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PREFACE 


The objective of this book is to offer students of science and engineering 
a concise, general, and easy-to-understand account of some of the most 
important concepts and methods of classical and computational solid me¬ 
chanics. The classical part is mainly a re-issue of Fung’s Foundations of 
Solid Mechanics, with a major addition to the modern theories of plasticity, 
and a major revision of the theory of large elastic deformation with finite 
strains. The computational part consists of five new chapters, which focus 
on numerical methods to solve many major linear and nonlinear boundary- 
value problems of solid mechanics. 

We hold the principle of easy-to-understand for the readers as an ob¬ 
jective of our presentation. We believe that to be easily understood, the 
presentation must be precise, the definitions and hypotheses must be clear, 
the arguments must be concise and with sufficient details, and the con¬ 
clusion has to be drawn very carefully. We strive to pay strict attention 
to these requirements. We believe that the method must be general and 
the notations should be unified. Hence, we presented the tensor analysis 
in general coordinates, but kept the indicial notations for tensors in the 
first fifteen chapters. In Chapters 16-21, however, the dyadic notations of 
tensors and the notations for matrix operations are used to shorten the 
formulas. 

This book was written for engineers who invent and design things for 
human kind and want to use solid mechanics to help implement their designs 
and applications. It was written also for engineering scientists who enjoy 
solid mechanics as a discipline and would like to help develop and advance 
the subject further. It was further designed to serve those physical and 
natural scientists and biologists and bioengineers whose activities might be 
helped by the classical and computational solid mechanics. For example, 
biologists are discovering that the functional behavior of cells depends on 
the stresses acting on the cell. It is widely recognized that the molecular 
mechanics of the cell must be developed as soon as possible. 

Solid mechanics deals with deformation and motion of “solids.” The 
displacement that connects the instantaneous position of a particle to its 
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position in an “original” state is of general interest. The preoccupation 
about particle displacements distinguishes solid mechanics from that of 
fluids. 

This book begins with an introductory chapter containing a brief sketch 
of history, an outline of some prototypes of theories, and a description of 
some more complex features of solid mechanics. In Chapter 2, an introduc¬ 
tion to tensor analysis is given. The bulk of the text from Chapters 3 to 16 
is concerned with the classical theory of elasticity, but the discussion also in¬ 
cludes thermodynamics of solid, thermoelasticity, viscoelasticity, plasticity, 
and finite deformation theory. Fluid mechanics is excluded, but methods 
that are common to both fluid and solid mechanics are emphasized. Both 
dynamics and statics are treated; the concepts of wave propagation are 
introduced at an early stage. Variational calculus is emphasized, since it 
provides a unified point of view and is useful in formulating approximate 
theories and computational methods. The large deflection theory of plates 
presented in the concluding section of Chapter 16 illustrates the elegance 
of the general approach to the large deformation theory. 

Chapters 17 to 21 are devoted to computational solid mechanics to deal 
with linear, nonlinear, and nonhomogeneous problems. It is recognized 
that the incremental approach is the most practical approach, hence the 
updated Lagrangian description. Chapter 17 develops the incremental the¬ 
ory in considerable detail. Chapter 18 is devoted to numerical methods, 
with the finite element methods singled out for a detailed discussion of 
the theory of elasticity. Chapter 19 presents methods of calculations based 
on the mixed and hybrid variational principles, illustrating the broadening 
of the computational power with a less restrictive (or weaker) hypothesis 
informulating the variational principle. Chapter 20 deals with finite ele¬ 
ment methods for plates and shells, making the computational methods 
accessible to the analysis of the structures of aircraft, marine architecture, 
land vehicles, and shell-like structures in human being, animals, plants, 
earth, globe, and space. Finally, the book concludes with Chapter 21 
dealing with finite element modeling of nonlinear elasticity, viscoelasticity, 
plasticity, viscoplasticity, and creep. Thus, a broad sweep of modern, 
advanced topics are covered. 

Overall, this book lays emphasis on general methodology. It prepares 
the students to tackle new problems. However, as it was said in the original 
preface of the Foundations of Solid Mechanics, no single path can embrace 
the broad field of mechanics. As in mountain climbing, some routes are 
safe to travel, others more perilous; some may lead to the summit, others 
to different vistas of interest; some have popular claims, others are less 
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traveled. In choosing a particular path for a tour through the field, one 
is influenced by the curricular, the trends in literature, and the interest in 
engineering and science. Here, a particular way has been chosen to view 
some of the most beautiful vistas in classical and computational mechanics. 
In making this choice, we have aimed at straightforwardness and interest, 
and practical usefulness in the long run. 

Holding the book to a reasonable length did not permit inclusion of 
many numerical examples, which have to be supplemented through prob¬ 
lems and references. Fortunately, there are many excellent references to 
meet this demand. We have presented an extensive bibliography in this 
book, but we suggest that the reader consults the review journal Applied 
Mechanics Reviews (AMR) published by the American Society of Mechan¬ 
ical Engineers International since 1947, for the current information. The 
reader is referred to the periodic in-depth reviews of the literature in specific 
issues of AMR. 

We are indebted to many authors and colleagues as acknowledged in 
the preface of the Foundations of Solid Mechanics. In the preparation of 
the present edition, we are especially indebted to Professors Satya Atluri 
and Theodore Pian. We would like to record our gratitude to many col¬ 
leagues who wrote us to discuss various points and sent us errata in the 
Foundations of Solid Mechanics, especially to Drs. Pao-Show D. Cheng, 
Shun Cheng, Ellis H. Dill, Clive L. Dym, J. B. Haddow, Manohar P. Ka- 
mat, Hans Krumhaar, T. D. Leko, Howard A, Magrath, Sumio Murakami, 
Theodore Pian, R. S. Rivlin, William P. Rodden, Bertil Storakers, Howard 
J. White, Jr. and John C. Yao. We would also like to thank professors 
Y. Ohashi, S. Murakami and N. Kamiya for translating the Foundations 
book into Japanese, Professors Oyuang Zhang, Ma Wen-Hua and Wang 
Kai-Fu for translating the Foundations book into Chinese. 

On the cover of this book, portraits of some pioneers of mechanics 
are presented. They are arranged, from left to right and top to bottom, 
according to their birthdays. They are: Galileo Galilei, 1564-1642, Isaac 
Newton, 1642-1727, Daniel Bernoulli, 1700-1782, Leonhard Euler, 
1707-1783, Joseph Louis Lagrange, 1736-1813, Claude Louis Marie Henri 
Navier, 1785-1836, Augustin Louis Cauchy, 1789-1857, Lord Kelvin, 
William Thompson, 1824-1907, Gustave Robert Kirchhoff, 1824-1889, Lord 
Rayleigh, John William Strutt, 1842-1919, Ludwig Edward Boltsmann, 
1844-1906, August Edward Hugh Love, 1863-1940, Stephen P. Timoshenko, 
1878-1972. These portraits were supplied by Dr Stephen Juhasz, who 
was the editor of the Applied Mechanics Reviews for 30 years, from 1953- 
1983. In 1973, Dr Juhasz published an article entitled “Famous Mechanics 
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Scientists” in App. Mech. Rev. Vol. 26, No. 2. These portraits are from 
that article, in which acknowledgement to original sources is recorded. We 
thank you, Dr Juhasz. 

We would like to take this opportunity to mention a few of editorial 
notes: 

(1) The bibliography is given at the end of the book. 

(2) Equations in each Section are numbered sequentially. When referring 
to equations in other Sections, we use the format (Sec. no: Eq. no), e.g. 
(5.6:4). 

(3) Formulas are concise way of saying lots of things. The most important 
formulas are marked with a triangular star, A. They are worthy of being 
committed to memory. 


Y. C. Fung 
P. Tong 
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INTRODUCTION 


Mechanics is the science of force and motion of matter. Solid mechanics is 
the science of force and motion of matter in the solid state. Physicists are 
of course interested in mechanics. The greatest advances in physics in the 
twentieth century are identified with mechanics: the theory of relativity, 
quantum mechanics, and statistical mechanics. Chemists are interested 
in the mechanics of chemical reaction, the formation of molecular aggre¬ 
gates, the formation of crystals, or the creation of new materials with de¬ 
sirable properties, or polymerization of larger molecules, etc. Biologists are 
interested in biomechanics that relates structure to function at all hierar¬ 
chical levels: from biomolecules to cells, tissues, organs, and individuals. 
Although a living cell is not a homogeneous continuum, it is a protein ma¬ 
chine, a protein factory, with internal machinery that moves and functions 
in an orderly way according to the laws of mechanics. Therefore, all scien¬ 
tists are interested in mechanics and mechanics is developed by scientists 
continuously. 

Engineers, especially aeronautical, mechanical, civil, chemical, materi¬ 
als, biomedical, biotechnological, space, and structural engineers, are real 
developers and users of fluid and solid mechanics because of their profes¬ 
sional needs. They design. They invent. They are concerned about the 
safety and economy of their products. They want to know the function 
of their products as precisely as possible. They want results fast. They 
experiment. They theorize. They test, compute, and validate. To them 
mechanics is a toy, a bread and butter, a feast or delicacy. 

Engineering is quite different from science. Scientists try to understand 
nature. Engineers try to make things that do not exist in nature. Engineers 
stress invention. To embody an invention the engineer must put his idea in 
concrete terms, and design something that people can use. That something 
can be a device, a gadget, a material, a method, a computing program, an 
innovative experiment, a new solution to a problem, or an improvement on 
what is existing. Since a design has to be concrete, it must have its geome¬ 
try, dimensions, and characteristic numbers. Almost all engineers working 
on new designs find that they do not have all the needed information. Most 
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often, they are limited by insufficient scientific knowledge. Thus they study 
mathematics, physics, chemistry, biology and mechanics. Often they have 
to add to the sciences relevant to their profession. Thus engineering sciences 
are born. 

This book is written by engineering scientists, for engineering scientists, 
and this determines its style. The qualities we want are: 

• Easy to read, 

• Precise, concise, and practical, 

• First priority on the formulation of problems, 

• Presenting the classical results as gold standard, and 

• Numerical approach as everyday tool to obtain solutions. 

If the book is a banquet, we offer some hors d’oeuvres in this introductory 
chapter. 

1.1. HOOKE’S LAW 

Historically, the notion of elasticity was first announced in 1676 by 
Robert Hooke (1635-1703) in the form of an anagram, ceiiinosssttuv. He 
explained it in 1678 as 

Ut tensio sic vis , 

or “the power of any springy body is in the same proportion with the 
extension.” f 

As stated in the original form, Hooke’s law is not very clear. Our first 
task is to give it a precise expression. Historically, this was done in two 
different ways. The first way is to make use of the common notion of 
“springs,” and consider the load-deflection relationship. The second way is 
to state it as a tensor equation connecting the stress and strain. Although 
the second way is the proper way to start a general theory, the first, simpler 
and more restrictive, is not without interest. In this section, we develop 
the first alternative as a prototype of the theory of elasticity. 

Let us consider the static equilibrium state of a solid body under the 
action of external forces (Fig. 1.1:1). Let the body be supported in some 
manner so that at least three points are fixed in a space which is de¬ 
scribed with respect to a rectangular Cartesian frame of reference. We shall 
make three basic hypothesis regarding the properties of the body under 
consideration. 

(HI) The body is continuous and remains continuous under the action 
of external forces. 

t Edme Mariotte enunciated the same law independently in 1680. 
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Under this hypothesis the atomistic structure of the body is ignored 
and the body is idealized into a geometrical copy in Euclidean space whose 

points are identified with the material 
particles of the body. Continuity is de¬ 
fined in mathematical sense as an iso¬ 
morphism of the real number system. 
Neighboring points remain as neigh¬ 
bors under any loading condition. No 
cracks or holes may open up in the in¬ 
terior of the body under the action of 
external load. 

A material satisfying this hypoth¬ 
esis is said to be a continuum. The 
study of the deformation or motion of 
a continuum under the action of forces 
is called the continuum mechanics. 

To introduce the second hypothesis, let us consider the action of a set 
of forces on the body. Let every force be fixed in direction and in point of 
application, and let the magnitude of all the forces be increased or decreased 
together: always bearing the same ratio to each other. Let the forces be 
denoted by Px, P 2 ,..., P„ and their magnitude by Pi, P 2 ,..., P n . Then 
the ratios Pi : P 2 : ... : P n remain fixed. When such a set of forces 
is applied on the body, the body deforms. Let the displacement at an 
arbitrary point in an arbitrary direction be measured with respect to a 
rectangular Cartesian frame of reference fixed with the supports. Let this 
displacement be denoted by u. Then our second hypothesis is 

(H2) Hooke’s law ; 



Fig. 1.1 ■ 1. Static equilibrium of a 
body under external forces. 


(1) U = a\P\ + a 2 p 2 H-b a n p n , 

where ai,a 2 ,...,a„ are constants independent of the magnitude of Pi, 
P 2 ,...,P n . The constants a\,a 2 ,... ,a n depend, of course, on the loca¬ 
tion of the point at which the displacement component is measured and 
on the directions and points of application of the individual forces of the 
loading. 

Hooke’s law in the form (H2) is one that can be subjected readily to 
direct experimental examination. 

To complete the formulation of the theory of elasticity, we need a third 
hypothesis: 
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(H 3 ) There exists a unique unstressed state of the body , to which the 
body returns whenever all the external forces are removed. 

A body satisfying these three hypothesis is called a linear elastic solid. 
A number of deductions can be drawn from these assumptions. We shall 
list a few important ones. 

(A) Principle of superposition 

By a combination of (H 2 ) and (H 3 ), we can show that Eq. ( 1 ) is valid not 
only for systems of loads for which the ratios P\ : P 3 : ... : P n remain fixed 
as originally assumed, but also for an arbitrary set of loads Pi, P2, •.., P„. 
In other words, Eq. ( 1 ) holds regardless of the order in which the loads are 
applied. 

Proof. If a proof of the statement above can be established for an arbi¬ 
trary pair of loads, then the general theorem can be proved by mathematical 
induction. 

Let Pi and P2 (with magnitudes Pi and P2) be a pair of arbitrary 
loads acting at points 1 and 2 , respectively. Let the deflection in a specific 
direction be measured at a point 3 (see Fig. 1 . 1 : 1 ). According to (H 2 ), if 
Pi is applied alone, then at the point 3 a deflection U3 = C31P1 is produced. 
If P2 is applied alone, a deflection u 3 = C32P2 is produced. If Pi and P2 
are applied together, with the ratio Pi : P2 fixed, then according to (H 2 ) 
the deflection can be written as 

(a) u 3 - c' 3l Pi + c' 32 P 2 . 

The question arises whether c( u = C31, c' 32 = C32. The answer is af¬ 
firmative, as can be shown as follows. After Pi and P2 are applied, we 
take away Pi. This produces a change in deflection, — c 31 Pi, and the total 
deflection becomes 

(b) u 3 = c' 31 Pi + C32P2 — c 3 'iPi . 

Now only P2 acts on the body. Hence, upon unloading P2 we shall have 

(c) U 3 = C31P1 +C32P2 -C31P1 - C32P2 . 

Now all the loads are removed, and u 3 must vanish according to (H 3 ). 
Rearranging terms, we have 

(d) (c' 31 — c 31 )Pi = (C32 — c 32 )P 3 • 

Since the only possible difference of c 31 and c 3l must be caused by the 
action of P2, the difference c' 3l - c 3l can only be a function of P 2 (and not 
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of Pi). Similarly, C32 — c ' 32 can only be a function of Pi. If we write Eq. (d) 
as 

/.-i C 31 - C 31 C 32 - C32 

<e) = 

then the left-hand side is a function of P2 alone, and the right-hand side 
is a function of Pi alone. Since P x and P2 are arbitrary numbers, the only 
possibility for Eq. (e) to be valid is for both sides to be a constant k which 
is independent of both Pi and P2. Hence, 

(f) C32 = c 32 - kPi. 

But a substitution of (f) into (a) yields 

(g) «3 = C31P1 + C32P2 - kPiP 2 . 

The last term is nonlinear in Pi, P 2 , and Eq. (g) will contradict (H2) unless 
k vanishes. Hence, k = 0 and C32 = C32. An analogous procedure shows 

C31 = C31 = C31. 

Thus the principle of superposition is established for one and two forces. 
An entirely similar procedure will show that if it is valid for m forces, 
it is also valid for m + 1 forces. Thus, the general theorem follows by 
mathematical induction. Q.E.D. 

The constants C31, C32, etc., are seen to be of significance in defining the 
elastic property of the solid body. They are called influence coefficients or, 
more specifically, flexibility influence coefficients. 

(B) Corresponding forces and displacements and the unique meaning of 
the total work done by the forces. 

Let us now consider a set of external forces Pi,..., P n acting on the body 
and define the set of displacements at the points of application and in the 
direction of the loads as the displacements “corresponding” to the forces 
at these points. The reactions at the points of support are considered as 
external forces exerted on the body and included in the set of forces. 

Under the loads Pi,..., P n , the corresponding displacements may be 
written as 

Mi = cnPi + C12P2 H-b ci„P n , 

/0 s M 2 = C 2 lPl + C22P2 + • • ' + C 2 n Pn , 


M n — C„iPi -|- C n2 P2 + ' ‘ ‘ 4" C nn P n . 
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If we multiply the first equation by Pi, the second by P2, etc., and add, we 
obtain 

(3) Piui + P2U2 +-f P n u n = c n Pi + C12P1P2 -f-f Ci„PiP„ 

+ C21P1P2 + C22P2 H-+ C-nnPn • 

The quantity above is independent of the order in which the loads are 
applied. It is the total work done by the set of forces. 

(C) Maxwell’s reciprocal relation 

The influence coefficients for corresponding forces and displacements are 
symmetric. 

( 4 ) Cij — Cji . 

In other words, the displacement at a point i due to a unit load at another 
point j is equal to the displacement at j due to a unit load at i, provided 
that the displacements and forces “correspond,” i.e., that they are measured 
in the same direction at each point. 

The proof is simple. Consider two forces Pi and P2 (Fig. 1 . 1 : 1 ). When 
the forces are applied in the order Pi, P2, the work done by the forces is 
easily seen to be 

W = -(ciiPj 2 + C22P 2 ) + C12P1P2 ■ 

When the order of application of the forces is interchanged, the work done 
is 

W' = \{c 22 P* + CllP 2 ) + C21P1P2 . 

But according to (B) above, W = W for arbitrary Pi, P2. Hence, C12 = C21, 
and the theorem is proved. 

(D) Betti-Rayleigh reciprocal theorem 

Let a set of loads Pi,P2, • • •,Pn produce a set of corresponding displace¬ 
ments iti,U2 ,... ,u n . Let a second set of loads P'nP^,... ,P[„ acting in 
the same directions and having the same points of application as those of 
the first, produce the corresponding displacements u[, u ' 2 ,..., u' n . Then 

( 5 ) Piu'i -I- P2U2 H-f P n u' n =* P[u\ H-f P[u n . 

In other words, in a linear elastic solid, the work done by a set of forces 
acting through the corresponding displacements produced by a second set of 
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forces is equal to the work done by the second set of forces acting through 
the corresponding displacements produced by the first set of forces. 

A straightforward proof is furnished by writing out the Uj and u[ in 
terms of Pi and P[, (i = 1 , 2 ,with appropriate influence coeffi¬ 
cients, comparing the results on both sides of the equation, and utilizing 
the symmetry of the influence coefficients. 

In the form of Eq. ( 5 ), the reciprocal theorem can be generalized to 
include moments and rotations as the corresponding generalized forces and 
generalized displacements. An illustration is given in Fig. 1 . 1 : 2 . These 
theorems are very useful in practical applications. 


1 

p,=1 

1, 2 

IF 

h 

ii 

* 




For the same beam, c 21 = c, 2 ■ 

(a) Forces and corresponding displacements. 



(b) Generalized force (moment) and the 
corresponding generalized displacement 
(moment ~ rotation of angle). 


Fig. 1.1 :2. Illustration of the reciprocal theorem. 

(E) Strain energy 

Further insight can be gained from the first law of thermodynamics. When 
a body is thermally isolated and thermal expansions are neglected the first 
law states that the work done on the body by the external forces in a certain 
time interval is equal to the increase in the kinetic energy and internal 
energy in the same interval. If the process is so slow that the kinetic energy 
can be ignored, the work done is seen to be equal to the change in internal 
energy. 

If the internal energy is reckoned as zero in the unstressed state, the 
stored internal energy shall be called strain energy. Writing U for the 
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strain energy, we have, from ( 3 ) and ( 4 ), 

( 6 ) u = \ E <* p * p i = \ E c - p i + ^EE w • 

t=l i=l i=l i^j 


If we differentiate Eq. (6) with respect to Pi, we obtain 


du 

dPi 


Ca Pi -f~ ^ Cij Pj , 

j¥* 


i = 1 , 2 ,..., n. 


But, the right-hand side is precisely up, hence, we obtain 
(F) Castigliano’s theorem 


( 7 ) 



i — 1 , 


In other words, if a set of loads P\,... ,P n is applied on a perfectly elastic 
body as described above and the strain energy is expressed as a function 
of the set P\,... ,P rl , then the partial derivative of the strain energy, with 
respect to a particular load, gives the corresponding displacement at the 
point of application of that particular load in the direction of that load. 

(E) The principle of virtual work 

On the other hand, for a body in equilibrium under a set of external forces, 
the principle of virtual work can be applied to show that, if the strain energy 
is expressed as a function of the corresponding displacements, then 


(8) 



i = 1 , 


n. 


The proof consists in allowing a virtual displacement Su to take place in the 
body in such a manner that Su is continuous everywhere but vanishes at 
all points of loading except under Pi. Due to 6 u, the strain energy changes 
by an amount 6 U, while the virtual work done by the external forces is the 
product of Pi times the virtual displacement, i.e., PiSui. According to the 
principle of virtual work, these two expressions are equal, 5 U — PiSm. On 
rewriting it in the differential form, the theorem is established. 

The important result (8) is established on the principle of virtual work 
as applied to a state of equilibrium under the additional assumption that a 
strain energy function that is a function of displacement exists. It is ap¬ 
plicable also to elastic bodies that follow the nonlinear load-displacement 
relationship. 
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1.2. LINEAR SOLIDS WITH MEMORY: MODELS OF 
VISCOELASTICITY 

Most structural metals are nearly linear elastic under small strain, as 
measurements of load-displacement relationship reveal. The existence of 
normal modes of free vibrations which are simple harmonic in time, is of¬ 
ten quoted as an indication (although not as a proof) of the linear elastic 
character of the material. However, when one realizes that the vibration 
of metal instruments does not last forever, even in a vacuum, it becomes 
clear that metals deviate somewhat from Hooke’s law. Thus, other consti¬ 
tutive laws must be considered, The need for such an extension becomes 
particularly evident when organic polymers are considered. 

In this section we shall consider a simple class of materials which retains 
linearity between load and deflection, but the linear relationship depends 
on a third parameter, time. For this class of material, the present state of 
deformation cannot be determined completely unless the entire history of 
loading is known. 

A linear elastic solid may be said to have a simple memory: it remembers 
only one configuration; namely, the unstrained natural state of the body. 
Many materials do not behave this way: they remember the past. Among 
such materials with memory there is one class that is relatively simple in 
behavior. This is the class of materials named above, for which the cause 
and effect are linearly related. 



(a) (b) (c) 

Fig. 1.2:1. Models of linear viscoelasticity: (a) Maxwell, (b) Voigt, (c) Kelvin. 


Let us consider some simple examples. In Fig. 1.2:1 are shown three 
mechanical models of material behavior, namely, the Maxwell model, the 
Voigt model, and the Kelvin model, all of which are composed of combina¬ 
tions of linear springs with spring constant /i and dashpots (pistons moving 
in a viscous fluid) with coefficient of viscosity rj. A linear spring is sup¬ 
posed to produce instantaneously a deformation proportional to the load. 
A dashpot is supposed to produce a velocity proportional to the load at any 
instant. The load-deflection relationship for these models are 
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( 1 ) 

Maxwell model: 

F F 

u= - 1 -, 

p T) 

«<o)=m, 

p 

( 2 ) 

Voigt model: 

F — pu + r]u, 

"o 

II 

o 

( 3 ) 

Kelvin model: 

F + r e F = Er(u + Tail) , 

T e F( 0 ) = ERT a u( 0 ), 


where r £ , r CT are two constants. When these equations are to be integrated, 
the initial conditions at t = 0 must be prescribed as indicated above. 

The creep functions c(t), which are the displacement u(t) in response 
to a unit-step force F(t) = 1 (t) defined in Eq. ( 7 ) below, are the solution 
of Eqs. (l)-( 3 ). They are: 

( 4 ) Maxwell solid: 

( 5 ) Voigt solid: 

c(t)= i(l-e-^‘)l(t), 

M 

( 6 ) Kelvin solid: 

where the unit-step function 1 (f) is defined as 

{ 1 when t > 0 , 

— when t = 0 , 

0 when t < 0 . 

A body which obeys a load-deflection relation like that given by Maxwell’s 
model is said to be a Maxwell solid. Similarly, Voigt and Kelvin solids are 
defined. These models are called models of viscoelasticity. 

Interchanging the roles of F and u, we obtain the relaxation function as 
a response F(t) = k(t) corresponding to an elongation u(t) = 1 (f), Eq. ( 7 ). 

( 8 ) Maxwell solid: 

k(t) = p e ~^ Mt l(t), 

( 9 ) Voigt solid: 


k(t ) = r]S(t) + pi (t) 
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( 10 ) Kelvin solid: 


k(t) = E r 



1(0 • 


Here we have used the symbol 6(t) to indicate the unit-impulse function, 
or Dirac-delta function, which is defined as a function with a singularity at 
the origin: 


( 11 ) 


S(t) = 0 


/: 


ms(t)dt = m, 


for t < 0, and t > 0, 
e > 0, 


where fit) is an arbitrary function continuous at t = 0 . These functions, 
c(t) and k(t), are illustrated in Figs. 1 . 2:2 and 1 . 2 : 3 , respectively, for which 
we add the following comments. 



Fig. 1 . 2 : 2 . Creep function of (a) Maxwell, (b) Voigt, (c) Kelvin solid. A negative phase 
is superposed at the time of unloading. 


For the Maxwell solid, a sudden application of a load induces an imme¬ 
diate deflection by the elastic spring, which is followed by “creep” of the 
dashpot. On the other hand, a sudden deformation produces an immediate 
reaction by the spring, which is followed by stress relaxation according to 
an exponential law Eq. (8). The factor rj/p, with dimensions of time, may 
be called a relaxation time: it characterizes the rate of decay of the force. 

For the Voigt solid, a sudden application of force will produce no im¬ 
mediate deflection because the dashpot, arranged in parallel with the 
spring, will not move instantaneously. Instead, as shown by Eq. ( 5 ) and 
Fig. 1 . 2 : 2 (b), a deformation will be gradually built up, while the spring 
takes a greater and greater share of the load. The dashpot displacement 
relaxes exponentially. Here the ratio r)/p is again a relaxation time: it 
characterizes the rate of decay of the deflection. 
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(a) (b) (c) 

Fig. 1.2:3. Relaxation function, of (a) Maxwell, (b) Voigt, (c) Kelvin solid. 

For the Kelvin solid, a similar interpretation is applicable. The con¬ 
stant is the time of relaxation of load under the condition of constant 
deflection (see Eq. (10)], whereas the constant r a is the time of relaxation 
of deflection under the condition of constant load [see Eq. (4)]. As t —> oo, 
the dashpot is completely relaxed, and the load-deflection relation becomes 
that of the springs, as is characterized by the constant Er in Eqs. (4) and 
(10). Therefore, Er is called the relaxed elastic modulus. 

Load-deflection relations such as (l)-(3) were proposed to extend the 
classical theory of elasticity to include anelastic phenomena. Lord Kelvin 
(Sir William Thomson, 1824-1907), on measuring the variation of the rate 
of dissipation of energy with frequency of oscillation in various materi¬ 
als, showed the inadequacy of the Maxwell and Voigt equations. A more 
successful generalization using mechanical models was first made by John 

H. Poynting (1852-1914) and Joseph John Thomson (“J.J.,” 1856-1940) in 
their book Properties of Matter (London: C. Griffin and Co., 1902). 

I. 3. SINUSOIDAL OSCILLATIONS IN A VISCOELASTIC 

MATERIAL 

It is interesting to examine the relationship between the load and the 
deflection in a body when it is forced to perform simple harmonic oscil¬ 
lations. A simple harmonic oscillation can be described by the real or 
imaginary part of a complex variable. Thus, if Fo is a complex variable 
Fo = Ae 1 ^ = A(cos d> + * sin </>), then a simple harmonic oscillatory force can 
be written as 

F(f) = Foe Mt = A(cosd> + isin^)(cosw£ + isinwf) 

= A cos(o >t + <f>) +iA sin(wt + cfi). 


( 1 ) 
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Similarly, if uq = then a simple harmonic oscillatory displacement 

u(t) = uoe tu}t is either the real part or the imaginary part of 

(2) u(t) — uoe tult = B cos (ut + ip) + iB sin(cvf + ip). 

On substituting (1) and (2) into Eqs. (l)-(3) of Sec. 1.2, we can obtain the 
ratio Uo/Fq , which is a complex number. The inverse, Fq/uq, is called the 
complex modulus of a viscoelastic material, and is often designated by M : 

(3) M =F 0 /u 0 = \M\e iS , 


where |Ad| is the magnitude and 6 is the phase angle by which the strain 
lags behind the stress. The tangent of S is often used as a measure of the 
internal friction of a linear viscoelastic material: 


( 4 ) 


tan S = 


imaginary part of M. 
real part of Ad 


For the Kelvin model, we have 


( 5 ) 

( 6 ) 


M = 


1 + iu>T a 
1 + iu)T e 


Er , 



1/2 

Er , 


tan 6 = 


u{r a - r £ ) = (tv - T e ) iv(tVT e) 1 / 2 

1 +U) 2 (t <t T € ) (TVTe) 1 / 2 1 + CJ 2 (T a T e ) ' 


When |Ad | and tan 5 in (5) and (6) are plotted against the logarithm of 
uj, curves as shown in Fig. 1.3:1 are obtained. Experiments with torsional 



oi/rwT 


Fig. 1.3:1. Frequency dependence of internal friction and elastic modulus. 
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Pig. 1 . 3 : 2 . A typical relaxation spectrum. (After C. M. Zener, Elasticity and Anelas- 
ticity of Metals , The University of Chicago Press, 1948.) 

oscillations of metal wires at various temperatures, reduced to room tem¬ 
perature according to certain thermodynamic formula, yield a typical 
“relaxation spectrum” as shown in Fig. 1.3.2. Many peaks are seen in 
the internal-friction-versus-frequency curve. It has been suggested that 
each peak should be regarded as representing an elementary process as de¬ 
scribed above, with a particular set of relaxation times r a , t f . Each set of 
relaxation times r CT , t e can be attributed to some process in the atomic or 
microscopic level. A detailed study of such a relaxation spectrum tells a 
great deal about the structures of metals; and the study of internal friction 
has provided a very effective key to metal physics. 

1.4. PLASTICITY 

Take a small steel rod. Bend it. When the deflection is small, the rod 
will spring back to its original shape when you release the load. This is elas¬ 
ticity. When the deflection is sufficiently large, a permanent deformation 
will remain when the load is released. That is plasticity. The load-deflection 
relationship of an ideal plastic material is shown in Fig. 1.4:1(a). For 
an ideal plastic material, the deflection is zero when the load is smaller 
than a critical value. Then, when the load reaches a critical value, the de¬ 
flection continues to increase (the material flows) as long as the same load 
remains. If, at a given deflection, the load becomes smaller than the criti¬ 
cal value, then the flow stops. There is no way the load can be increased 
beyond the critical value. 
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(a) Ideal plasticity (b) Linear elastic — ideal plastic material 

Fig. 1 . 4 : 1 . 

Structural steel behaves pretty much like an ideal plastic material, ex¬ 
cept that when the load (measured in terms of the maximum shear stress) 
is smaller than the critical load (called the yield shear stress ), the load- 
deflection curve is an inclined straight line (Hooke’s Law). Upon unloading, 
there is a small rebound. There are some details at the yield point that were 
ignored in the statement above. Other metals, such as copper, aluminum, 
lead, stainless steel, etc., behave in a somewhat similar, but more complex 
manner. Metals at a sufficiently high temperature may behave more like 
a fluid. Theories that deal with these features of materials are called the 
theories of plasticity , which are presented in Chapter 6. 

1.5. VIBRATIONS 

We know vibrations by experience while driving a car, flying an airplane, 
playing a musical instrument. The trees sway in the wind. A building 
shakes in an earthquake. Sometimes we want to know if a structure is 
safe in vibration. Sometimes we want to design a cushion that isolates 
an instrument from vibrations. A prototype of this kind of problem is 
shown in Fig. 1.5:l(a). A body with mass M is attached to an initially 
vertical massless spring, which has a spring constant k, and a damping 
constant c, and is “built-in” to a “ground” which moves horizontally with 
a displacement history s(t). Let x(t) denote the horizontal displacement 
of the mass and a dot over x or s denote a differentiation with respect to 
time. Then x is the acceleration of the body, k(x — s ) is the spring force 
acting on the body, and c(x — s) is the viscous damping force acting on the 
body. Newton’s second law requires that 

(1) Mx + c(x — s) + k(x — s) = 0 . 

If we let 

( 2 ) 


y = x — S 




Fig. 1.6:1. 

represent the displacement of the body relative to the ground, then Eq. (1) 
may be written as 

(3) My + cy + ky — -Ms. 

The solution of Eq. (3) represents a forced vibration of a damped system. 
If the forcing function Ms(t) were zero, the equation 

(4) Mi) + cy + ky = 0 , 

describes the free vibration of the damped system. If the damping constant 
c vanishes, then the free vibration of the undamped system is described by 
the equation 

(5) My + ky — 0 . 

Equation (5) is satisfied by the solution 

(6) y = A cos tot + B sin tot , 
in which A and B are arbitrary constants, and 

(7) w = \fkjM , 

as can be verified by direct substitution of (6) into (5). When k and M are 
real values, w is real. The motion y{t) given by (6) is an oscillation of the 
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body at a circular frequency of v rad/ sec. It can exist without external 
load. Hence tv is called the natural frequency of a free vibration. If the 
damping constant c were zero, and the forcing function s is periodic with 
the same frequency as the natural one, then the amplitude of the oscillation 
is unbounded, and we have the phenomenon of resonance. 

The solution of Eq. (4) must be an exponential function of time, because 
the derivative of an exponential function is another exponential function. 
Thus 

(8) y(t) = Ae xt implies y(t) = AXe xt , y(t) = AX 2 e xt . 

On substituting (8) into Eq. (4), we have 

(9) MX 2 + cX + k = 0. 


If we write 

(10) k/M = v 2 , e = c/(2VkM), 

then the two roots of Eq. (9) may be written as Ai and A 2 : 

(11) Ai = —etv + iv\Jl - e 2 , A 2 = -ev - itvy/l - e 2 . 

When k and M are real positive numbers, tv is real, and the solution of 
Eq. (9) under the initial conditions 

(12) 2/(0) = y a , 2/(0) = y 0 when t = 0 . 

is 

(13) y(t) = y 0 e~ eu> 1 cos v\/l — e 2 1 

H- 7F= g {Vo + y 0 ev)e~ £Ut sinvy/l - e 2 1. 

wVl — £ 2 

Now, we can return to the solution of Eq. (3). A particular solution 
of Eq. (3) can be obtained by Laplace or Fourier transformation or other 
methods. By direct substitution, it can be verified that a particular solution 
satisfying the initial conditions y{t) = y(t) = 0 at t = 0 is: 

(14) y(t,v,r) = - r-- — / sinu>\/l - £ 2 (t - £)d£ . 

wvl - e l Jo 

The general solution of Eq. (3) is the sum of the functions given in (13) and 
(14). From this solution we can examine the nature of the forced oscillation 
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and its dependence on the parameters u>, e, and the frequency spectrum of 
the forcing function s(t). 

The simple solution given by Eqs. (13) and (14) has important appli¬ 
cations to the problems of isolation of delicate instruments in shipping, re¬ 
sponse of buildings to earthquake, impact of an airplane on landing, landing 
a robot on the moon. A frequently asked quention is: what is the maximum 
absolute value of the displacement y, or the acceleration y as functions of 
the peak values of the ground displacement s, ground acceleration s, the 
time course of the ground motion, the system characteristis M, c, k, and 
the initial conditions y 0 , y 0 i.e., 

(15) max \y(t)\/ max |s(f)| = function of s(t), M,c,k,y 0 ,y 0 

max \y{t)\/ max |s(t)| = functions of s(t), M, c, k,y 0 ,y 0 . 

These functions are called shock spectra. Studies of shock spectra are im¬ 
portant not only for technological applications, but also for mathematics. 
Note that although Eq. (3) is linear in y(t), the shock spectra are nonlinear 
functions of the parameters of the system. Qualitatively, we notice that 
the most important characteristic time of the system. Qualitatively, we 
notice that the most important characteristic time of the free oscillation is 
l/u>. The most important characteristic time of the excitation s(t) is the 
length of time it takes to rise from 0 to the peak value, t r . We call t r the 
rise time of the signal. Alluring to resonance, we see that a parameter of 
importance is the ratio of the two charateristic times l/u> and t r . Hence 
the shock spectra are principally functions of u)t r . See some references in 
the bibliography at the end of the book. 

The picture shown in Fig. 1.5:1(b) may represent, in a very crude way, 
a concrete floor of a steel frame building during an earthquake. A system 
shown in Fig. 1.5:l(c) may represent an astronaut in flight. A dumbbell 
shown in Fig. 1.5:1 (d) has been used to model a molecule. It is clear 
that fairly comprehensive models of dynamic systems can be devised by 
this approach, see, for example, Fig. 1.5:1(e). When there are too many 
particles of mass and springs of elasticity, however, simplicity may be lost, 
and one turns naturally to the continuum approach outlined in the next 
section. 

1.6. PROTOTYPE OF WAVE DYNAMICS 

A wire with an infinite number of particles of mass connected together 
elastically is an obvious candidate for the continuum approach. As an 
example, consider a wire as shown in Fig. 1.6:1. Let an axis x run along 
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Fig. 1.6:1. 

O'+ iff* 
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Fig. 1.6:2. 


the length of the wire, with an origin O chosen at the lower end. When 
the wire is loaded, each particle in the wire will be displaced longitudinally 
from its original position by an amount u. We shall consider only axial 
loading and assume that the plane cross sections remain plane, so that u is 
parallel to the z-axis and is a function of x. We shall assume further that 
u is infinitesimal, so that the strain in the wire is 


(1) 



The change in cross-sectional dimensions of the wire due to the axial load 
will be ignored. The wire is assumed to be elastic, and is so thin that all 
stress components other than the axial may be neglected. Then the axial 
stress is given by Hooke’s law 


in which E is a constant called the Young’s modulus of elasticity. Now 
consider an element of the wire of a small length dx (see Fig. 1.6:2). The 
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force acting at the lower end is a A, where A is the cross-sectional area of 
the wire. The force acting at the upper end is A(a + da). For a continuous 
function a(x) the differential da is equal to (da/dx)dx. The acceleration 
of the element, d 2 u/dt 2 , must be caused by the difference in axial forces. 
Now, the mass of the element is pAdx , with p denoting the density of the 
material. Hence, equating the mass times acceleration with the axial load 
Ada, and canceling the nonvanishing factor Adx, we obtain the equation 
of motion 


( 3 ) 


d 2 u da 
^ dt 2 dx 


A substitution of Eq. (2) yields the wave equation 


(4) 


d 2 u 1 d 2 u 
dx 2 c 2 dt 2 


in which the constant c is the wave speed: 



The wave speed c is a characteristic constant of the material. 
The general solution of Eq. (4) is 

(6) u = f(x — ct) + F(x + ct ), 


where / and F are two arbitrary functions. This can be verified by substi¬ 
tuting Eq. (6) directly into Eq. (4). The function u = f(x — ct) represents 
a wave propagating in the positive x direction. The function u = F(x + ct) 
represents a wave propagating in the negative x direction. In either case 
we have 


( 7 ) 


du 1 du 

± V 

dx c dt c 


with v = du/dt denoting the particle velocity. A substitution of (7) and 
(5) into (2) yields the formula 


(8) a = ±—v = ±pcv, 

c 

where the — sign applies to a wave propagating in the positive x direction, 
and the + sign applies to a wave in the other direction. This result is 
remarkable. It says that the stress is equal to the product of the mass 
density of the material, the velocity of the sound wave, and the longitudinal 
velocity of the particles of the wire. This result has a very important 
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application to an experiment initiated by John Hopkinson (1872) who hang 
a steel wire vertically from a ceiling, attached a stopper at the lower end as 
shown in Fig. 1.6:1, then dropped a massive weight down the wire from a 
measured height. When the weight struck the stopper, it stretches the wire 
suddenly. Hopkinson’s intention was to measure the strength of the wire 
with this method. Using different weights dropped from different heights, 
he found that the minimum height from which a weight had to be dropped 
to break the wire was, within certain limits, almost independent of the 
magnitude of the weight, and the diameter of the wire. 

Now, when different solid bodies are dropped from a given height, the 
velocity reached at any given time is independent of the weight. Thus 
Hopkinson explains his result on the basis of elastic wave propagation. 
Equation (8) shows that the stress is equal to pcv. The speed of sound of 
longitudinal waves in the wire, given by Eq. (5) is 16,000 ft/sec for steel. 
The velocity v in the wire, however, is not necessarily the largest at the 
instant of impact at the lower end. To find v, we have to solve Eq. (4) with 
the boundary conditions of the Hopkinson experiment (Fig. 1.6:1), which 
has a clamped upper end, and a movable lower end: 


(9) 

u 

= 0 

at x = L for all t , and 

(16) 




when t < 0 for all x , 


(10) 

dV 

= Act 

at x = 0 for all t, 

(17) 

(11) 

V 

= U 0 

at x = 0 when t — 0. 

(18) 


Here Vo is the velocity of the stopper, and M is the combined mass of the 
dropped weight and the stopper. The mathematical problem is reduced to 
finding the arbitrary function f(x — ct) and F(x + ct) so that u(x, t) given 
by Eq. (6) satisfies Eqs. (9)-(ll). 

Physically, the weight generated an elastic wave u(x, t) in the wire initi¬ 
ated by a velocity Vo at x = 0 when t = 0. The wave went up, reflected at 
the top, went down, reflected at the lower end by the moving weight, and so 
on. In John Hopkinson’s test, a 27-foot long wire and weights ranging from 
7 to 41 lbs. were used, and the absolute maximum of v and a was reached 
at the top after a number of reflections. The stress picture was very com¬ 
plicated. Bertram Hopkinson (1905) repeated his father’s experiment with 
a 1 lb. weight impacting on the lower end of a 30-foot long No. 10 gauge 
wire weighing 1.3 lbs. Nevertheless, as shown by G. I Taylor (1946), the 
maximum tensile stress in B. Hopkinson’s experiment did not occur at the 
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first reflection, when the stress was 2pcVo, but at the third reflection, i.e., 
the second reflection at the top of the wire, when the tensile stress reached 
2.15 pcVo. 

If the mass of the wire were negligible, then the wave velocity is very 
high and the system of Fig. 1.6:1 would become a mass and spring system 
of Sec. 1.5. 

Thus waves and vibrations are different features of the same dynamic 
system. Waves deliver extremely important information in a solid body. 
They are used in geophysics to probe the earth, to study earthquakes and 
to find oil and gas reserves. In medicine the pulse waves in the artery and 
the sound waves in the lung are used for diagnosis. In music the piano tuner 
and violin player listen to the vibration characteristics of their instruments. 
Some aspects of the general theory are discussed in Chapter 9. 

1.7. BIOMECHANICS 

Historically, the theory of solid mechanics was first developed along 
the lines of the linearized theory of elasticity, then it was expanded to 
the linearized theory of viscoelasticity. Then plasticity came on the scene. 
The nonlinear theory of large deformation and finite strain was developed. 
Further expansion was made possible by computational methods. Many 
significant problems in material science, aerospace structures, metal and 
plastics industries, geophysics, planetary physics, thermo-nuclear reactors, 
etc., were solved. Following this trend of expansion, it is natural to consider 
biological problems. 

Many mechanics scientists had contributed to the understanding of 
physiology, e.g: Galileo Galilei (1564-1642), William Harvey (1578-1658), 
Rene Descartes (1596-1650), Giovanni Alfonso Borelli (1608-1679), Robert 
Boyle (1627-1691), and Robert Hooke (1635-1703) before Newton (1642- 
1727), and Leonhard Euler (1707-1783), Thomas Young (1773-1829), Jean 
Poiseuille (1797-1869), Herrmann von Helmholtz (1821-1894) after New¬ 
ton. In fact, the Greek book On the Parts of Animals written by Aris¬ 
totle (384-322 BC) and the Chinese book Nei Jing (or Internal Classic) 
attributed to Huangti (Yellow Emperor), but was believed to be written 
by anonymous authors in the Warring Period (472-221 BC), contain many 
concepts of biomechanics. 

At the time when the manuscript of the first edition of this book was 
prepared, the molecule that is responsible for the genetics of the cells had 
been identified as DNA, and the double helix structure of DNA had been 
discovered. The beginning of a new age of understanding biology in terms 
of chemistry, physics and mechanics was recognized by many people. 



Sec. 1.7 


BIOMECHANICS 


23 


Biomechanics has the following salient features: 

(1) Material Constitution 

Every living organism has a solid structure that gives it a unique shape 
and size, and an internal fluid flow that transports materials and keep the 
organism alive. Hence biosolid mechanics is inseparable from biofluid me¬ 
chanics. The cells make new materials. Hence the composition, structure, 
and ultrastructure of biomaterials change dynamically. 

(2) The Constitutive Equations 

Blood is a non-Newtonian fluid. Synovial fluid in the knee joint, and 
body fluid in the abdominal cavity are also non-Newtonian. The bone obeys 
Hooke’s law. The blood vessel, the skin and most other soft tissues in the 
body do not obey Hooke’s law. We must know the constitutive equations 
of biological tissues, whose determination was a major task of the early 
biomechanics. When attention is focused on cells, the determination of 
the constitutive equations of DNA and other molecules becomes a primary 
task. 

(3) Growth and Remodeling of Living Tissues 
Under Stress 

Tissue is made of cells and extracellular matrices. The cell division, 
growth, hypertrophy, movement, or death are influenced by the cell geom¬ 
etry and the stress and strain in the cell, as well as by the chemical and 
physical environment. The tissue is, therefore, a changing, living entity. 
The determination of the growth law is a major task in biology. 

(4) The Existence of Residual Stress 

Organs such as the blood vessel, heart, esophagus and intestine have 
large residual strain and stress at the no-load condition. The residual strain 
changes in life because any birth or death of a cell in a continuum and the 
building and resorption of extracellular matrix by the cells create residual 
strains in the continuum. The changes of residual stress and strain cause 
changes of the stress state in vivo. The growth law and the residual stress 
are coupled. 

(5) Concern about the Hierarchy of Sizes 

In biology the hierarchy of sizes dominates the scene. Consider hu¬ 
man. In studying gait, posture and sports, the length scale of interest is 
that of the whole body. In the study of the hemodynamics of the heart 
valves, a characteristic length is the size of the left ventricle. For coronary 



24 


INTRODUCTION 


Chap. 1 


atherosclerosis studies, a characteristic dimension for hemodynamics in the 
vessel is the diameter of the coronary arteries, that for the shear stress 
on the vessel wall is the thickness of the endothelial cell, in the microme¬ 
ters range, that for the molecular mechanism in the cells must be in the 
nanometers range. For microcirculation, features must be measured in the 
scale of the diameters of the red blood cells and the capillary blood vessels. 
In studying heart muscle contraction we must consider features in length 
scale of the sarcomere. In considering gene therapy we must think of the 
functions of the DNA molecule. In determining the active transport phe¬ 
nomenon we must be concerned with ion channels in the cell membrane. 
At each level of the hierarchy, there are important problems to be solved. 
Each hierarchy has its own appropriate mechanics. Integration of all the 
hierarchies is the objective of physiological studies. 

(6) Perspectives 

Life is motion. The science of motion is mechanics. Hence biology 
needs mechanics. Molecular biology needs molecular mechanics. Cell bi¬ 
ology needs cell mechanics. Tissue biology needs tissue mechanics. Organ 
biology needs organ mechanics. Medicine, surgery, injury prevention, in¬ 
jury treatment, rehabilitation, and sports need mechanics. Solid and fluid 
mechanics have much to offer to biology. Yet, still, biomechanics seems to 
be a satellite, not in the main streams of biology and mechanics. Why? 
The reason probably lies in certain axiomatic differences between biology 
and mechanics. Classical continuum mechanics identifies a continuum as an 
isomorphism of the real number system. Biology studies molecules, cells, 
tissues, organs, and individuals in an orderly manner. The heirarchies are 
well defined. How can a biological entity be identified as a continuous body? 
The answer must lie in the clear recognition of the hierarchical structure. 
In each hierarchy, one must select a proper length scale as the starting 
point. No question with a characteristic length smaller than that selected 
dimension can be discussed. Then, in that hierarchy, one can identify a 
continum to study it. This is the first axiom of biomechanics. 

Secondly, solids of the classical solid mechanics have fixed structures and 
materials, an unchanging zero-stress state, and a set of fixed constitutive 
equations for the materials. Biomechanics, however, has to deal with DNA- 
controlled changes of materials, structures, zero-stress state, and constitu¬ 
tive equations. All these can be identified phenomenologically by suitably 
designed experiments. This is the second axiom of biomechanics. 

Mechanics and biology can be unified only if these axiomatic differences 
are recognized. Knowing the theoretical structure of biomechanics, we can 
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then develop classical and computational methods to cover it. Then biology 
becomes a new source of fresh water to irrigate our field of mechanics. 

The present book will not deal with biomechanics any further. Some 
references are given in the Bibliography at the end of the book. 

1.8. HISTORICAL REMARKS 

The best-known constitutive equation for a solid is the Hooke’s law of 
linear elasticity, which was discovered by Robert Hooke in 1660. This law 
furnishes the foundation for the mathematical theory of elasticity. By 1821, 
Louis M. H. Navier (1785-1836) had succeeded in formulating the general 
equations of the three-dimensional theory of elasticity. All questions of the 
small strain of elastic bodies were thus reduced to a matter of mathemat¬ 
ical calculation. In the same year, 1821, Fresnel (1788-1827) announced 
his wave theory of light. The concept of transverse oscillations through an 
elastic medium attracted the attention of Cauchy and Poisson. Augustin 
L. Cauchy (1789-1857) developed the concept of stress and strain, and for¬ 
mulated the linear stress-strain relationship that is now called Hooke’s law. 
Simeon D. Poisson (1781-1840) developed a molecular theory of elasticity 
and arrived at the same equation as Navier’s. Both Navier and Poisson 
based their analysis on Newtonian conception of the constitution of bod¬ 
ies, and assumed certain laws of intermolecular forces. Cauchy’s general 
reasoning, however, made no use of the hypothesis of material particles. 
In the ensuing years, with the contributions of George Green (1793-1841), 
George Stokes (1819-1903), Lord Kelvin (William Thomson, 1824-1907), 
and others, the mathematical theory was established. The fundamental 
questions of continuum mechanics have received renewed attention in re¬ 
cent years. Through the efforts of C. Truesdell, R. S. Rivlin, W. Noll, J. 
L. Erikson, A. E. Green, and others, theories of finite strain and nonlinear 
constitutive equations have been developed. Another focus of development 
is in the areas of micron and submicron-structures in which strain gradient 
effect can become dominant. In biomechical field, the study of the growth 
and remodeling of living tissues will lead continuum mechanics to another 
plateau. New developments in continuum mechanics since late 1940’s have 
been truly remarkable. 

The great impact of solid mechanics on civilization, however, is felt 
through its application to technology. In this respect a fine tradition was 
established by early masters. Galileo Galilei (1564-1642) considered the 
question of strength of beams and columns. James (or Jacques) Bernoulli 
(1654-1705) introduced the simple beam theory. Leonhard Euler (1707- 
1783) gave the column formula. Charles Augustin Coulomb (1736-1806) 
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considered the failure criterion. Joseph Louis Lagrange (1736-1813) gave 
the equation that governs the bending and vibration of plates. Navier and 
Poisson gave numerous applications of their general theory to special prob¬ 
lems. The best example was set by Barre de Saint-Venant (1797-1886), 
whose general solution of the problems of torsion and bending of prismati- 
cal bars is of great importance in engineering. With characteristic consid¬ 
eration for those who would use his results he solved these problems with 
a completeness that included many numerical coefficients and graphical 
presentations. 

From this auspicious beginning, mechanics have developed into the 
mainstay of our civilization. Physics and chemistry are dominated by the 
theory of relativity, quantum mechanics, quantum electrodynamics. Geo¬ 
physics, meteorology, and oceanography rest on the mechanics of seismic 
waves, atmospheric and ocean currents, and acoustic waves. Airplanes, 
ships, rockets, spacecraft, bathyscaphs, automobiles, trains, rails, high¬ 
ways, buildings, internal combustion engines, jet engines, are designed, 
constructed, and maintained on the basis of fluid and solid mechanics. 
New materials are invented for their mechanical properties. Artificial heart 
valves, hearts, kidneys, limbs, skin, pacemakers are mechanical marvels. 
Fundamental mechanics developed side by side with the developments in 
science and technology. In the first half of the 20th century, advancements 
were mainly concentrated on analytical solutions of the differential equa¬ 
tions. In the second half of the 20th century, computational methods took 
center stage. These trends are reflected in this book. 

PROBLEMS 

1.1. Prove that it is possible to generalize Hooke’s law to deal with moments 
and angles of rotations by considering a concentrated couple as the limiting case of 
two equal and opposite forces approaching each other but maintaining a constant 
moment. 

1.2. A pin-jointed truss is shown in Fig. PI.2. Every member of the truss is 
made of the same steel and has the same cross-sectional area. Find the tension or 
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compression in every member when a load P is applied at the point shown in the 
figure. Note: A joint is said to be a pin joint if no moment can be transmitted 
across it. This problem is statically determinate. 

1.3. Find the vertical deflections at a, b, c of the truss in Fig. PI.2 due to a 
load P at point b. Assume that P is sufficiently small so that the truss remains 
linear elastic. Use the fact that for a single uniform bar in tension the total 
change in length of this bar is given by PL/AE , where P is the load in the bar, L 
is the bar length, A is the cross-sectional area, and E is the Young’s modulus of 
the material. For structural steels E is about 3 x 10 7 Ib/sq in, or 20.7 x 10 7 kPa. 
Hint: Use Castigliano’s theorem. 

1.4. A “rigid” frame of structural steel shown in Fig. PI.4 is acted on by 
a horizontal force P. Find the reactions at the points of support a, and d. The 



cross-sectional area A and the bending rigidity constant El of the various mem¬ 
bers are shown in the figure. 

By a “rigid” frame is meant a frame whose joints are rigid; e.g., welded. Thus, 
when we say that corner 6 is a rigid joint, we mean that the two members meeting 
at b will retain the same angle at that corner under any loading. The frame under 
loading will deform. It is useful to sketch the deflection line of the frame. Try to 
locate the points of inflection on the members. The points of inflection are points 
where the bending moments vanish. 

The reactions to be found are the vertical force V, the horizontal force H, 
and the bending moment M, at each support. 

Use engineering beam theory for this problem. In this theory, the change of 
curvature of the beam is M/EI, where M is the local bending moment and El 
is the local bending rigidity. The strain energy per unit length due to bending is, 
therefore, M 2 /2EI. The strain energy per unit length due to a tensile force P is 
P 2 /2EA. The strain energy due to transverse shear is negligible in comparison 
with that due to bending. 
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1.5. A circular ring of uniform linear elastic material and uniform cross 
section is loaded by a pair of equal and opposite forces at the ends of a diameter 
(Fig. PI.5). Find the change of diameters aa and bb. Note : This is a statically 
indeterminate problem. You have to determine the bending moment distribution 
in the ring. 

1.6. Compare the changes in diameters aa, bb, and cc when a circular ring of 
uniform linear elastic material and uniform cross section is subjected to a pair of 
bending moments at the ends of a diameter (Fig. PI.6). 

1.7. Begg's Deformeter. G. E. Beggs used an experimental model to de¬ 
termine the reactions of a statically indeterminate structure. For example, to 
determine the horizontal reaction H at the right-hand support b of an elastic 
arch under the load P, he imposes at b a small displacement S in the horizontal 



Pi.5 PI.6 

direction and measures at P the deflection 5' in the direction corresponding with 
P, while preventing the vertical displacement and rotation of the end b, Fig. PI.7. 
Show that H = —PS'/S. (Use the reciprocal theorem.) [G. E. Beggs, J. Franklin 
Institute, 203 (1927), pp. 375-386.] 
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1.8. A simply supported, thin elastic beam of variable cross section with 
bending rigidity EI(x) rests on an elastic foundation with spring constant k and 
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is loaded by a distributed lateral load of intensity p(x) per unit length (Fig. PI.8). 
Find an approximate expression for the deflection curve by assuming that it can 
be represented with sufficient accuracy by the expression 

N 

. . n7 rx 

u(x) = 2_^ An sin —j— . 

n = l 

Note: This expression satisfies the end conditions for arbitrary coefficients a„. 
Use the minimum potential energy theorem. 

1.9. Consider a uniform cantilever beam clamped at x = 0 (Fig. PI.9). 
According to Bernoulli-Euler theory of beams, the differential equation governing 
the deflection of the beam is 


d?w _ M 

~ch? ~ EH' 


Pi.9 

where w is the deflection parallel to the 2 -axis and M is the bending moment 
at station x. This equation is valid if the beam is straight and slender and if 
the loading acts in a plane containing a principal axis of all cross sections of the 
beam, in which case the deflection w occurs also in that plane. 

Use this differential equation and the boundary conditions: 

At damped end : w = ^ = 0. 

dx 



At free end : EI~ = El— = 0. 

ax 2, ax A 

Derive the deflection curve of the beam when it is loaded by a unit force located 
at x = £. 

x 2 

Ans. For 0 < x < £ < l : w = (3£ — x). 

f 2 

For 0<£<x<!: w — -E— (3x — £). 

oEI 

The solution fits the definition of influence coefficients and is known as an influ¬ 
ence function. 

1.10. Let the beam of Problem 1.9 be divided into six equidistant sections 
as marked in Fig. PI.9. Compute the influence coefficients Cij, where i, j = 
1 , 2 ,..., 6 . 
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In attempting to develop the theories outlined in the previous chapter 
rigorously and succinctly, we shall first learn to use the powerful tool of 
tensor calculus. At first sight, the mathematical analysis may seem in¬ 
volved, but a little study will soon reveal its simplicity. 

2.1. NOTATION AND SUMMATION CONVENTION 

Let us begin with the matter of notation. In tensor analysis one makes 
extensive use of indices. A set of n variables x\, X 2 , ■ ■ ■, £„ is usually denoted 
as Xi, i = 1,..., n. A set of n variables y l ,y 2 ,... ,y n is denoted by y l , i = 
1 ,..., n. We emphasize that y 1 , y 2 , ..., y n are n independent variables and 
not the first n powers of the variable y. 

Consider an equation describing a plane in a three-dimensional space 
x l ,x 2 ,x^, 

(1) a\x l + a 2 X 2 + a 3 x 3 — p, 

where aj and p are constants. This equation can be written as 
3 

( 2 ) '^a i x l =p. 

i=1 

However, we shall introduce the summation convention and write the 
equation above in the simple form 

(3) aiX 1 = p. 

The convention is as follows: The repetition of an index ( whether superscript 
or subscript ) in a term will denote a summation with respect to that index 
over its range. The range of an index i is the set of n integer values 1 to 
n. A lower index i, as in a,, is called a subscript, and upper index i, as in 
x l , is called a superscript. An index that is summed over is called a dummy 
index ; one that is not summed out is called a free index. 

Since a dummy index just indicates summation, it is immaterial which 
symbol is used. Thus, aiX 1 may be replaced by a 3 x J , etc. This is analogous 
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to the dummy variable in an integral 


J f(x)dx = J f(y)dy. 


The use of index and summation convention may be illustrated by other 
examples. Consider a unit vector v in a three-dimensional Euclidean space 
with rectangular Cartesian coordinates x, y, and 2 . Let the direction cosines 
a.i be defined as 


£*1 = cos(i/, x), a .2 = cos(v, y), 013 = cos(i/, z), 

where ( 1 /, x) denotes the angle between u and the a;-axis, etc. The set of 
numbers ai(i = 1,2,3) represents the projections of the unit vector on the 
coordinate axes. The fact that the length of the vector is unity is expressed 
by the equation 

(«i) 2 + (a 2 ) 2 + (03) 2 = 1, 

or simply 

(4) a = 1. 

As a further illustration, consider a line element ( dx , dy, dz) in a three- 
dimensional Euclidean space with rectangular Cartesian coordinates x, y, z. 
The square of the length of the line element is 

(5) ds 2 = dx 2 + dy 2 + dz 2 . 


If we define 

(6) dx 1 = dx , dx 2 = dy , dx 3 = dz , 

and 

^ ^ <^11 - <522 = <5.-13 = 1 , 

<5l2 = <521 = <5l3 = <531 = <523 = <532 = 0 • 

Then (5) may be written as 

(8) ds 2 = 6 i jdx l dxi , 

with the understanding that the range of the indices i and j is 1 to 3. Note 
that there are two summations in the expression above, one over i and one 
over j. The symbol <5<j as defined in Eq. (7) is called the Kronecker delta. 
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The following determinant illustrates another application 

a Il a 22°33 + 021032013 + 03^12023 
— O11O32O23 — 012021033 — 013 022031 . 

If we denote the general term in the determinant by and write the 
determinant as |a y |, then the equation above can be written as 

(9) |op'j ^ratO r iO s 20t3 , 

where e rs t, the permutation symbol , is defined by the equations 


ail 

ai2 

013 

“21 

022 

023 

a3i 

032 

033 


e lll = ^222 — 6333 = en2 = en3 — e221 = e 223 j 
= 6331 = e332 = 0 , 

( 10 ) A 1 

6123 — 6231 = e3!2 — 1 , 

6213 = 6321 - ei32 = -1. 

In other words, ey*, vanishes whenever the values of any two indices 
coincide; e^k = 1 when the subscripts permute like 1,2,3; and e^k = — 1 
otherwise. 

The Kronecker delta and the permutation symbol are very important 
quantities which will appear again and again in this book. They are con¬ 
nected by the identity 

(11) A &ijk&ist ” bjs&kt &jtbks • 

This e-5 identity is used sufficiently frequently to warrant special attention 
here. It can be verified by actual trial. 

Finally, we shall extend the summation convention to differentiation 
formulas. Let /(x 1 , x 2 ,..., x n ) be a function of n variables x 1 , x 2 ,..., x n . 
Then its differential shall be written as 

(12) df — dx i . 

ox 1 

The Kronecker delta and the permutation symbol play important roles in 
vector or tensor operations in rectangular Cartesian coordinate system. The 
dot product of two vectors x = (xi,X 2 ,X 3 ) and y = (j/x, 1/2 > yz) is a scalar 


c = x y = x\y\ + x 2 y 2 + ^32/3 , 
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which can be written as 

(13) c = S i:j Xiyj = Xi%h . 

The vector product of two vectors x and y is the vector z = x x y whose 
three components are 

(14) z\ = x 2 2/3 - x 3 y 2 , z 2 = x 3 y\ - aqy 3 , z 3 = aqy 2 - x 2 yi. 

This can be shortened by writing 

(15) z-i - eijkXjyk . 

PROBLEMS 

2.1. Show that, when i, j, k range over 1,2,3, 

(a) SijSij = 3 (c) e i]k AjA k = 0 

(b) = 5 (d) SijSjk ~ Sik 

2.2. Verify the following identity connecting three arbitrary vectors by 
means of the e-6 identity. 

A x (B x C) = (A • C)B - (A • B)C . 

Note: The last equation is well known in vector analysis. After identifying the 
quantities involved as Cartesian tensors, this verification may be construed as a 
proof of the e-6 identity. 

2.2. COORDINATE TRANSFORMATION 

The central point of view of tensor analysis is to study the change of 
the components of a quantity such as a vector with respect to coordinate 
transformations. 

A set of independent variables aq, x 2 , x 3 may be thought of as specifying 
the coordinates of a point in a frame of reference. A transformation from 
x 2 , x 3 to a set of new variables Sq, x 2 , x 3 through the equations 

(1) Xi — fi(x i,x 2 ,x 3 ), i = 1,2,3, 

specifies a transformation of coordinates. The inverse transformation 

(2) Xi = gi(xi,x 2 ,xz) i = 1,2,3, 

proceeds in the reverse direction. In order to insure that such a transforma¬ 
tion is reversible and in one-to-one correspondence in a certain region R of 
the variables ( xi,x 2 ,x 3 ), i.e., in order that each set of numbers (aq, x 2 , x 3 ) 
defines a unique set of numbers (aq, x 2 , x 3 ), for (aq, x 2 , x 3 ) in the region R, 
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and vice versa, it is sufficient to meet the following conditions: 


(a) The functions f x are single-valued, continuous, and possess 
continuous first partial derivatives in the region R, and 

(b) The Jacobian determinant J = \dx,/dx :j \ does not vanish at any 
point of the region R, i.e., 


(3) 


J = 


dx\ 

dx\ 

dx\ 

dx\ 

dx 2 

dx 3 

dx 2 

dx 2 

dx 2 

dx\ 

dx 2 

dx 3 

dx 3 

dx 3 

dx 3 

dx\ 

dx 2 

dx 3 


^ 0 in R. 


Coordinate transformations with the properties (a), (b) named above 
are called admissible transformations. If the Jacobian is positive every¬ 
where, then a right-handed set of coordinates is transformed into another 
right-handed set, and the transformation is said to be proper. If the 
Jacobian is negative everywhere, a right-handed set of coordinates is trans¬ 
formed into a left-handed one, and the transformation is said to be 
improper. In this book, we shall tacitly assume that our transformations 
are admissible and proper. 


2.3. EUCLIDEAN METRIC TENSOR 

The first thing we must know about any coordinate system is how to 
measure length in that reference system. This information is given by the 
metric tensor. 

Consider a three-dimensional Euclidean space, with the range of all 
indices 1, 2, 3. Let 

(1) Oi = 0 i (x 1 ,x 2 ,x 3 ), 

be an admissible transformation of coordinates from the rectangular 
Cartesian (in honor of Cartesius, i.e., Descartes) coordinates x\, x 2 ,x 3 to 
some general coordinates 9\, 62 ,^ 3 ,. The inverse transformation 

(2) X x = Xi(01,02,0.')) , 

is assumed to exist, and the point (mi, X 2 , £ 3 ) and ( 61 , 62 , 63 ) are in one-to- 
one correspondence. 

Consider a line element with three components given by the differ¬ 
entials dx 1 ,dx 2 ,dx 3 . Since the coordinates £ 1 , 2 : 2 , £3 are assumed to be 
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rectangular Cartesian, the length of the element ds is determined by 
Pythagoras’ rule: 


(3) A 


ds 2 — dx l dx l = Sijdx l dx l . 


Here ds 2 is the square of ds, not ds with a superscript 2. When a coordinate 
transformation (1) is effected, we obtain from Eq. (2), according to the 
ordinary rules of differentiation, 


(4) 


dx i = ^de k . 

99k 


Substituting (4) into (3), we have 


ds 2 — 


dxi dxi 

99 k 99 m 


d9 k d9 ” 


If we define the functions g k m{9i, $ 2 , # 3 ) by 


( 5 ) a gkm{6l ,e 2 ,e 3 ) = ^- k ^, 

then the square of the line element in the general 9 \, 9 2 ,9 3 coordinates takes 
the form 

(6) A ds 2 = g k md9 k d9 m . 

This, of course, stands for 

(7) ds 2 = g n (d9 1 ) 2 + g u d9 l d9 2 + g l 3 d9 l d9 3 

+ g 2 id9 l d9 2 + g 2 2 {d 9 2 ) 2 + g 2 3 d 9 2 d 6 3 
+ g 3 l d9 1 d9 3 +g 32 d9 2 d9 3 + g 33 {d9 3 ) 2 . 

It is apparent from (5) that 

(8) A gkm = g-mk for each k and m , 

so the functions gkm are symmetric in k and m. The functions gkm are 
called the components of the Euclidean metric tensor in the coordinate 
system 9y,9 2 ,9 3 . 
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Let #i ,$2 > #3 be another general coordinate system. Let 


(9) #i=#i(#l,# 2 ,# 3 ), 

be the transformation of coordinates from #i, # 2 , #3 to # 1 , # 2 , # 3 ' Now 

( 10 ) 

Hence, from Eq. (6), 


d6 k = ^ d6 l 

d&i 


( 11 ) 

If we define 
( 12 ) 


ds — Qkm nn on ## d9 

dOi d0 n 


Qln (# 1 , # 2 i #3) = 9 Jcm(#l, # 2 , #3) 




d6 t dO n 

Then Eq. (11) assumes a form which is the same as Eq. (3) or (6): 
(13) ds 2 = g ln dO l d9 n . 


Accordingly, we call gi n the components of the Euclidean metric tensor in 
the coordinate system # 1 , # 2 , # 3 - 

The quadratic differential forms (3), (6), and (13) are of fundamental 
importance since they define the length of any line element in general co¬ 
ordinate systems. We conclude that if #i,# 2!#3 and 0], # 2 , #3 are two sets 
of general coordinates, then Euclidean metric tensors < 7 *; m (#i, # 2 , # 3 ) and 
9km (# 1 , # 2 , # 3 ) are related by means of the law of transformation (12). 

The law of transformation of the components of a quantity with respect 
to coordinate transformation is an important property of that quantity. In 
the following section, we shall see that a quantity shall be called a tensor 
if and only if it follows certain specific laws of transformation. 

All the results above apply as well to the plane (a two-dimensional 
Euclidean space), as can be easily verified by changing the range of all 
indices to 1, 2. 


PROBLEMS 

2.3. Find the components of the Euclidean metric tensor in plane polar 
coordinates (9 1 = r,0 2 = #; see Fig. P2.3) and the corresponding expression for 
the length of a line element 

Ans. Let £ 1 , 0:2 be a set of rectangular Cartesian coordinates. 
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Then 


Xl=0lCOS0 2 , 01 = \/(xi) 2 + (x 2 ) 2 , 

X 2 


x? = 0i sin 02 , 0 2 = sin 


y/{x i ) 2 + (x 2 y 



e 2 


P2.3 


gn = cos 2 02 + sin 2 0 2 = 1, 

512 = (cos0 2 )(—0i sin 0i) + (sin 02 )( 0 i cos0 2 ) = 0 = <721 
9 i 2 = (—01 sin 02 ) 2 + (0i cos 02 ) 2 = (0i) 2 ■ 

The line element is 

ds 2 = (d0i) 2 + (0i) 2 (d0 2 ) 2 . 


2.4. Let xi, x 2 j X 3 be rectangular Cartesian coordinates and 0 i, 0 2 ,03 be 
spherical polar coordinates. (See Fig. P2.4.) Then 


0i = \/(xi) 2 + (x 2 ) 2 + (x 3 ) 2 , 

X3 


02 = COS 


\/(xi) 2 + (X 2) 2 + (X 3) 2 
1 X 2 


03 = tan — 

Xl 

and the inverse transformation is 

xi = 0i sin 02 cos 03 , 
x 2 = 0i sin 02 sin 03 , 
X3 = 01 COS 02 . 



Show that the components of the Euclidean metric tensor in the spherical polar 
coordinates are 

511 = 1 , 022 = (0l ) 2 , 033 = (0l) 2 (sin02) 2 , 

and all other gy = 0. The square of the line element is, therefore, 
ds 2 = [dj 6 1) 2 + (0i) 2 (d0 2 ) 2 + ( 0 i) 2 (sin 0 2 ) 2 (d 03) 2 . 
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2.5. Show that the length of the line element (d£d,0,0) is yjgw \dO 1 ]; that 
of the line element (0,<f(9 2 ,0) is sjgn |d# 2 |. 

2.6. Let the angle between the line elements {dO 1 ,0, 0) and (0, dd 2 ,0) be 
denoted by (*i 2 . Show that 



Hint: Find the components of the line elements with respect to a system of 
rectangular Cartesian coordinates in which we know how to compute the angle 
between two vectors. 

2.4. SCALARS, CONTRAVARIANT VECTORS, COVARIANT 
VECTORSt 

In nonrelativistic physics there are quantities like mass and length which 
are independent of reference coordinates, and there are quantities like 
displacement and velocity whose components do depend on reference coor¬ 
dinates. The former are the scalars; the latter, vectors. Mathematically, 
we define them according to the way their components change under ad¬ 
missible transformations. In the following discussions we consider a system 
whose components are defined in the general set of variables 9 l and are 
functions of 0 1 , 0 2 , & 3 . If the variables 0 l can be changed to 0 l by an ad¬ 
missible and proper transformation, then we can define new components of 
the system in the new variables O' 1 . The system will be given various names 
according to the way in which the new and the old components are related. 

A system is called a scalar if it has only a single component <f> in the 
variables 9 l and a single component cj> in the variables 9\ and if <j> and <]> 
are numerically equal at the corresponding points, 

( 1 ) <!>{6\0 2 ,6 3 ) = 4>(0\e' 2 ,9 3 ). 

A system is called a contravariant vector field, or a contravariant tensor 
field of rank one, if it has three components £ l in the variables 0 l and three 
components in the variables Q l , and if the components are related by the 
law 


(2) A 


C{9\9\9 3 ) 


(fi{9 1 ,6 2 ,9 3 ) 


d¥ 

W' 


^One may pass over the rest of this chapter and proceed directly to Chapter 3 on first 
reading. 
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A contravariant vector is indicated by a superscript, called a contravariant 
index. 

A differential d8 l is a prototype of a contravariant vector: 


(3) 


89 

86 k 


k 


Hence, a vector is contravariant if it transforms like a differential.f 

A system is called a covariant vector field , or a covariant tensor field of 
rank one, if it has three components r)i in the variables Q 1 , 9 2 , 0 3 and three 
components fji in the variables 9 1 , 9 2 , 0 3 , and if the components in these 
two coordinate systems are related by the law 


(4) A ia ,0 3 )=r fk (0 1 ,0 a ,0 3 )^. 

A covariant vector is indicated by a subscript, called a covariant index. 

A gradient of a scalar potential <i> transforms like (4), 

8<t> _ 8<f> 89 k 

' ' W~ dPW ' 

Hence, a vector is covariant if it transforms like a gradient of a scalar 
potential. 


2.5. TENSOR FIELDS OF HIGHER RANK 

A system such as the three-dimensional Euclidean metric tensor g^j has 
nine components when i and j range over 1, 2, 3. Such quantities are given 
special names when their components in any two coordinate systems are 
related by specific transformation laws. 

Covariant tensor field of rank two, Uj\ 

(1) i i} {6\9\ P) = t mn (6\6\ 

Contravariant tensor field of rank two, P : 


( 2 ) i lJ (e l ,e 2 , e 3 ) = t mn {o l ,o 2 , e 3 )~—. 

w v ' K ’ d6 m 89 n 

tThe variables O' and O', in general, are not so related [see Eq. (2.3:9)). Thus, although 
the differential dO' is a contravariant vector, the set of variables O' itself does not trans¬ 
form like a vector. Hence, in this instance, the position of the index of O' must be 
regarded as without significance. 
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Mixed tensor field of rank two, tj: 

(3) m\o 2 ,e 3 ) = c(e\e 2 ,e 3 )^^. 

w < > j n V > > ;^ 

Generalization to tensor fields of higher ranks is immediate. Thus, we call 
a quantity a tensor field of rank r — p + q, contravariant of rank p 

and covariant of rank q, if the components in any two coordinate systems 
are related by 

<A\ r d6 a ' M av dO mi ,fc, -k,, 

> 0X-P, QQk, d6 k r ' OOP' dQP* 

Thus, the location of an index is important in telling whether it is 
contravariant or covariant. Again, if only rectangular Cartesian coordinates 
are considered, the distinction disappears. 

These definitions can be generalized in an obvious manner if the range 
of the indices are 1,2,..., n. 

PROBLEMS 

2.7. Show that, if all components of a tensor vanish in one coordinate 
system, then they vanish in all other coordinate systems which are in one-to-one 
correspondence with the given system. 

This is perhaps the most important property of tensor fields. 

2.8. Prove the theorem: The sum or difference of two tensors of the same 
type and rank (with the same number of covariant and the same number of con¬ 
travariant indices) is again a tensor of the same type and rank. 

Thus, any linear combination of tensors of the same type and rank is again a 
tensor of the same type and rank. 

2.9. Theorem. Let Aa\...a a r i Ba\.'..a’ r be tensors. The equation 

... ,e n ) = B^\::i s r {e\e 2 ,... ,e n ), 

is a tensor equation; i.e., if this equation is true in some coordinate system, then 
it is true in all coordinate systems which are in one-to-one correspondence with 
each other. 

Hint: Use the results of the previous problems. 

2.6. SOME IMPORTANT SPECIAL TENSORS 

If we define the Kronecker delta and the permutation symbol introduced 
in Sec. 2.1 as components of covariant, contravariant, and mixed tensors of 
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ranks 2 and 3 in rectangular Cartesian coordinates £ 1 , 2 : 2 , 2 : 3 , 


( 1 ) 

Sij = 6*7 = 6* 


= 0 when i / j , 

= 1 when i = j, j not summed, 


1 

r = ° 

when any two indices are equal, 

( 2 ) 

e ijk = e*7 k J 

1 =i 

when i,j, k permute like 1, 2, 3 , 


1 

l = -i 

when i, j, k permute like 1 ,3, 2 , 


what will be their components in general coordinates 0 l 7 The answer is 
provided immediately by the tensor transformation laws. Thus, 


(3) 


dx m dx n dx m dx m 

9ij = ~d¥~807 brnn = ~80* r ~807' 


(4) 


ae i dd j ae i oe j 

dx m dx n dx m dx m ’ 


(5) 

( 6 ) 


gi _ 88* 8x n 5m 


€ijk — 


dx m 807 
dx r dx s 8x l 

1 ) 0*887 88 * 


80 l 8x rn 
8x m lyoi 

8x m 


&rst — &ijk 


86 n 


= 6 \ 


— e ijky/9 > 


(7) 


€ ijk _ 


80* 807 88 k 
8x r 8x s 8x l 


e rst _ e ijk 


88 m 

8x n 


e ijk 

T' 


where g is the value of the determinant \gij | and is positive for any proper 
coordinate system, i.e., 

(8) g = Iffijl > 0- 

We see that the proper generalizations of the Kronecker delta are the 
Euclidean metric tensors, and those of the permutation symbol are e^k and 
e tjfc , which are called permutation tensors or alternators. As defined by 
Eqs. ( 6 ) and (7), the components of ey*, and e ,jfc do not have values 1, —1, 
or 0 in general coordinates, they are (y/g, — y/g, 0 ) and (1 / yfg, —1 /yfg, 0 ) 
respectively. 

Note that the mixed tensor g* is identical with <5* and is constant in all 
coordinate systems. From Eqs. (3) and (4) we see that 


(9) 


9im9 


mj _ f 


Hence, the determinant 

\9im9 mj \ = \9ij\ - 1^1 = 1^1 = 1 - 
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Using (3) and (4), we can write 


( 10 ) 


9 — Iffy | — | 

i=l««l = 


dx l 

Wi 

| ae i 

dxi 


|2 


|2 


Since g is positive, Eq. (9) may be solved to give 

D ij 


( 11 ) 


ff 


where is the cofactor of the term gij in the determinant g. The tensor 
g lJ is called the associated metric tensor. It is as important as the metric 
tensor itself in the further development of tensor analysis. 


PROBLEM 

2.10. Prove Eqs. (6), (7) and (10). Write down explicitly D tj in Eq. (11) in 
terras of ffa,ffi2, •. • ,ff33- 


2.7. THE SIGNIFICANCE OF TENSOR CHARACTERISTICS 

The importance of tensor analysis may be summarized in the following 
statement. The form of an equation can have general validity with respect 
to any frame of reference only if every term in the equation has the same 
tensor characteristics. If this condition is not satisfied, a simple change 
of the system of reference will destroy the form of the relationship. This 
form is, therefore, only fortuitous and accidental. 

Thus, tensor analysis is as important as dimensional analysis in any 
formulation of physical relations. Dimensional analysis considers how a 
physical quantity changes with the particular choice of fundamental units. 
Two physical quantities cannot be equal unless they have the same 
dimensions; any physical equation cannot be correct unless it is invariant 
with respect to change of fundamental units. 

Whether a physical quantity should be a tensor or not is a decision for 
the physicist to make. Why is a force a vector, a stress tensor a tensor? 
Because we say so! It is our judgement that endowing tensorial character 
to these quantities is in harmony with the world. 

Once we decided upon the tensorial character of a physical quantity, we 
may take as the components of a tensor field in a given frame of reference 
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any set of functions of the requisite number. A tensor field thus assigned 
in a given frame of reference then transforms according to the tensor trans¬ 
formation law when admissible transformations are considered. In other 
words, once the values of the components of a tensor are assigned in one 
particular coordinate system, the values of the components in any general 
coordinate system are fixed. 

Why are the tensor transformation laws in harmony with physics? Be¬ 
cause tensor analysis is designed so. For example, a tensor of rank one is 
defined in accordance with the physical idea of a vector. The only point 
new to the student is perhaps the distinction between contravariance and 
covariance. In elementary physics, natural laws are studied usually only 
in rectangular Cartesian coordinates of reference, in which the distinction 
between the contravariance and covariance disappears. When curvilinear 
coordinates are used in elementary physics, the vectorial components must 
be defined specifically in each particular case, and mathematical expres¬ 
sions of physical laws must be derived anew for each particular coordinate 
system. These derivations are usually quite tedious. Now, what is achieved 
by the definition of a tensor is a unified treatment, good for any curvilinear 
coordinates, orthogonal or nonorthogonal. This simplicity is obtained, how¬ 
ever, at the expense of recognizing the distinction between contravariance 
and covariance. 

In Sec. 2.14 we discuss the geometric interpretation of the tensor compo¬ 
nents of a vector in curvilinear coordinates. It will become clear that each 
physical vector has two tensor images: one contravariant and one covariant, 
depending on how the components are resolved. 

2.8. RECTANGULAR CARTESIAN TENSORS 

We have seen that it is necessary to distinguish the contravariant and 
covariant tensor transformation laws. However, if only transformations 
between rectangular Cartesian coordinate systems are considered, the dis¬ 
tinction between contravariance and covariance disappears. To show this, 
let xi,X 2 , X 3 and x\, x' 2 , £3 be two sets of rectangular Cartesian coordinates 
of reference. The transformation law must be simply 

(1) Xi = PijXj + cq , 

where 01 , 02 , <23 are constants and P,j are the direction cosines of the angles 
between unit vectors along the coordinate axes xt and xj. Thus, 

(2) An = cos(£ 2 , 2 a), 
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etc. The inverse transform is 


(3) 


Xi = cos(xi, xj)xj + bi = cos(xj, Xi)xj + bi 
— fijiXj T , 


where 

Hence, 

(4) 


bi — Pki&k ■ 

dxjc _ p _ dxj 

dxi 1 dxk 


and the distinction between the transformation laws (2.4:2) and (2.4:4) 
disappears. 

The components of a vector in the two coordinate systems are related 

by 

fji(xi,X 2 ,X 3 ) = PijT)j(.Xi, x 2 ,x 3 ). 


The components of a tensor of rank two are related by 


Zij (x,X 2 ,X 3 )= PikPjatks {x\,X 2 ,X 3 ). 

When only rectangular Cartesian coordinates are considered, we shall 
write all indices as subscripts. This convenient practice will be followed 
throughout this book. 


2.9. CONTRACTION 


We shall now consider some operations on tensors that generate new 
tensors. 

Let Aj kl be a mixed tensor so that, in a transformation from the coor¬ 
dinates x a to 5 a (a = 1,2,..., n), we obtain 


dx l dx@ dx 1 dx s 
dx a dxi dx k dx 1 


A 0-ys( x )- 


If we equate the indices i and k and sum, we obtain the set of quantities 


A jil i x } 


dx 1 dx@ dx 7 dx 5 
dx a dxi dx 1 dx 1 


_ dx? dx s 
dxi dx 1 “ 


(z) 


dx p dx 5 

dxi dx 1 ^s{ x )- 
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Let us write Ap aS as Bps. Then the equation above shows that 


Bji 


dx^dx*_ 
dxi dx l P 5 


Hence, Bps satisfies the tensor transformation law and is therefore a tensor. 

The process of equating and summing a covariant and a contravariant 
index of a mixed tensor is called a contraction. It is easy to see that the 
example above can be generalized to mixed tensors of other ranks. The 
result of a contraction is another tensor. If, as a result of contraction, 
there is no free index left, the resulting quantity is a scalar. 

The following problem shows that, in general, equating and summing 
two covariant indices or two contravariant indices does not yield a tensor of 
lower order and is not a proper contraction. However, when only Cartesian 
coordinates are considered, we write all indices as subscripts and contract 
them by equating two subscripts and summing. 


PROBLEM 

2.11. If A) is a mixed tensor of rank two, show that A\ is a scalar. If A' 3 is 
a contravariant tensor of rank two show that in general A" is not an invariant. 
Similarly, if An is a covariant tensor of rank two, A,, is not, in general, an 
invariant. 


2.10. QUOTIENT RULE 

Consider a set of n 3 functions >1(111), >1(112), >1(123), etc., or A(i,j,k) 
for short, with the indices i, j, k each ranging over 1,2,..., n. Although the 
set of functions A(i,j,k) has the right number of components, we do not 
know whether it is a tensor or not. Now, suppose that we know something 
about the nature of the product of A(i,j,k) with an arbitrary tensor. Then 
there is a theorem which enables us to establish whether A(i,j,k) is a 
tensor without going to the trouble of determining the law of transformation 
directly. 

For example, let f a (x) be an arbitrary tensor of rank 1 (a vector). Let 
us suppose that the product A(a,j,k)£ a (summation convention used over 
a) is known to yield a tensor of the type A 3 k (x), 

A(a,j,k,)C = Ai. 

Then we can prove that A(i,j,k) is a tensor of the type Aj k (x). 
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The proof is very simple. Since A(a,j,k)£ a is of type A k , it is trans¬ 
formed into 5-coordinates as 


A(a,j,k)C = Ai 


dxi dx s r 
dx r dx k s 


f)'T S si 


But = (dx@ /dx 01 )^ 01 . Inserting this in the right-hand side of the equation 
above and transposing all terms to one side of the equation, we obtain 


A(a,j,k) 


dxf dx s dx® 
dx r dx k dx a ’ 


r, 


s ) 


r = o. 


Now is an arbitrary vector. Hence, the bracket must vanish and we have 


dx j dx s dx * 3 

A ( a ' j ' k) = d^d¥d^ A{f3 ' r ' s) ’ 


which is precisely the law of transformation of the tensor of the type A\ k . 

The pattern of the example above can be generalized to prove the theo¬ 
rem that, if [A(ii,i 2 , ■ ■ ., i r )] is a set of functions of the variables x l , and if 
the product A(a, h, - ■ ■, ir)(, a with an arbitrary vector £“ be a tensor of the 
type where p + q = r, then the set A(ii,i 2 ,... ,i r ) represents a 

tensor of the type A^ ki 3 f kp (x). 

Similarly, if the product of a set ofn 2 functions A(a,j) with an arbitrary 
tensor B a k (and is summed over a) is a covariant tensor of rank 2, then 
A(i,j) represents a tensor of the type A*. 

These and similar theorems that can be derived are called quotient rules. 
Numerous applications of these rules follow. See, for example, Sec. 3.3, 
following Eq.(3.3:3); Sec. 3.12, following Eq. (3.12:11); Sec. 4.1, following 
Eqs. (4.1:11) and (4.1.:12). 


2.11. PARTIAL DERIVATIVES IN CARTESIAN 
COORDINATES 

The generation of a tensor of rank one from a tensor of rank zero by 
differentiation, such as the gradient of a scalar potential, indicates a way 
of generating tensors of higher rank. But, in general, the set of partial 
derivatives of a tensor does not behave like a tensor field. However, if 
only Cartesian coordinates are considered, then the partial derivatives of 
any tensor field behave like the components of a tensor field under a trans¬ 
formation from Cartesian coordinates to Cartesian coordinates. To show 
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this, let us consider two Cartesian coordinates {x\, X 2 ,x 3 ) and {x\.X 2 , £ 3 ) 
related by 

(1) Xi — dijXj "f* bi , 

where ay and b z are constants. From Eq. ( 1 ), we have 


(2) 

dxi 

d Xj 

(3) 

d 2 Xi 

dxjdxk 


Now, if £ l (a;i, x 2 , £ 3 ) is a contravariant tensor, so that 

_ r)'y • 

(4) C(xi,x 2 ,x 3 ) = ^°‘(xi,X2,X 3 )-^- L . 

Then, on differentiating both sides of the equation, one obtains 

d£ l = f d£ a dx p \ dx t / d 2 Xj \ dxp 

' dxJ \dxfs dxj) dx a \ dx a dxp J dxj 


When Xi and Xi are Cartesian coordinates, the last term vanishes according 
to Eq. (3). Hence, 

dC _ d£ a dxp dxj 
dxj dxp dxj dx a 

Thus, the set of partial derivatives d£ a /dxp follows the transformation 
law for a mixed tensor of rank two under a transformation from Cartesian 
coordinates to Cartesian coordinates. However, the presence of the second 
derivative terms in Eq. (5) which does not vanish in curvilinear coordinates 
shows that d£ a /dxp are not really the components of a tensor field in 
general coordinates. A similar situation holds obviously also for tensor 
fields of higher ranks. See Sec. 2.12 below. 

When Cartesian coordinates are used, we shall use a comma to denote 
partial differentiation. Thus, 


~ d Xj ’ 



&ij,k — 


dcfjj 
dx k ' 


When we restrict ourselves to Cartesian coordinates, are ten¬ 

sors of rank one, two, three, respectively, provided that </>, C. are tensors. 

Warning: Cartesian tensor equations derived through the use of differ¬ 
entiation are in general not valid in curvilinear coordinates. This important 
point is discussed in the following five sections. 
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2.12. COVARIANT DIFFERENTIATION OF VECTOR FIELDS 


The generalization of the concept of partial derivatives to the concept 
of covariant derivative, so that the covariant derivative of a tensor field is 
another tensor field, is the most important milestone in the development of 
tensor calculus. It is natural to search for such an extension in the form of 
a correction term that depends on the vector itself. Thus, if C is a vector, 
we might expect the combination 


ae 

dxi 


+ r (i,j, a)£ a , 


to be a tensor. Here the suggested correction is linear function of (*, and 
r(i,j, a) is some function with three indices. The success of this scheme 
hinges on the Euclidean Christoffel symbols, which are certain linear combi¬ 
nations of the derivatives of the metric tensor g. y. This subject is beautiful, 
and the results are powerful in handling curvilinear coordinates. However, 
since the topic is not absolutely necessary for the development of solid me¬ 
chanics, we shall not discuss it in detail, but merely outline below some of 
the salient results. 

We discussed in Sec. 2.3 the metric tensor gy in a set of general coordi¬ 
nates (re 1 , x 2 , x 3 ), and in Sec. 2.6 the associated metric tensor g li . By means 
of these metric tensors, the Euclidean Christoffel symbols Y l af3 {x l , x 2 , x 3 ) 
are defined as follows: 


(1) 






( dg<70 dg arT 
V dx a 8x0 


dg a p \ 
dx ) 


The is not a tensor. It transforms under a coordinate transformation 
x l = f l (x 1 ,x 2 , x 3 ) as follows (see Prob. 2.24, p. 55) 


(2) T i al3 (x 1 ,x 2 ,x 3 ) = r^(x 1 


3 dxv dx v dx 1 
’ dx a dx@ dx x 


d 2 x x dx 1 
dx a dxP dx x ' 


This equation can be solved for d' 2 x x /dx n dx 13 by multiplying (2) with 
dx m /dx 1 and sum over i to obtain 


(3) 


d 2 x x ( -,9x x __ x ( s dx^ dx" 

dx^dxP al3[X) dx i ^ [X) dx a dxP ‘ 


Interchanging the roles of Xj and x, and with suitable changes in indices, 
we can substitute (3) into Eq. (2.11:5) to obtain 


dff dtj x dxJ L dx 1 A <9x^ " dx 1 dx m dx n 

dx a dx* 1 dx a dx x + dx a X>1 * dx* mn ' X dx x dx* 1 
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which can be reduced to 


But this states that the functions d^/dx^ + T x ^ s are the components of 
a mixed tensor of rank two. Hence, the functions 


(4) 


dC - - 

— + P 
dx a ma? 


^ , r x «\ dxtl dsi 

dx^ ) dx a dx x 


(5) A 


{*1. - 


are the components of a mixed tensor field of rank two , called the covariant 
derivative of the contravariant vector £*. We shall use the notation £*| a for 
the covariant derivative of £*. 

By a slight variation in the derivation, it can be shown that the functions 


(•> » = 

are the components of a covariant tensor field of rank two whenever & are 
the components of a covariant vector field. This is called the covariant 
derivative of and is denoted by &| a . 

More generally, a long but quite straightforward calculation analogous 
to the above can be made to establish the covariant derivative of a tensor 
T Z- of rank p + q, contravariant of rank p, covariant of rank q: 

Qrp a l' a P 

rpOil-'-atp I Pi •■•/3q pai rpOOC2'"<Xp | 

I Pl-P q 17 - Q x -y 1 0 ~i i Pi02'Pq + " ' 

”' a p-1 ° p<7 rpOt\Ol2' "CXp p(7 rpOtl' 'Otp-ia p 

+ 1 cry 1 P\-P q -iP q 1 P\T L ct 02---P q 1 PqT l Pi-p <l - 1 <r ' 

This derivative is contravariant of rank p, and covariant of rank q + 1. 

Since the components of the metric tensor gij are constant in Cartesian 
coordinates, we see from Eq. (1) that the Euclidean Christoffel symbols are 
zero in Cartesian coordinates. The covariant derivative of a tensor field 
reduces to partial derivatives of the tensor field when the tensor field and 
the operations are evaluated in Cartesian coordinates. 


2.13. TENSOR EQUATIONS 

The theorems included in the problems at the end of Sec. 2.5 contain 
perhaps the most important property of tensor fields: if all the components 
of a tensor field vanish in one coordinate system, they vanish likewise in 
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all coordinate systems which can be obtained by admissible transformations. 
Since the sum and difference of tensor fields of a given type are tensors of 
the same type, we deduce that if a tensor equation can be established in one 
coordinate system, then it must hold for all coordinate systems obtained by 
admissible transformations. 

The last statement affords a powerful method for establishing equations 
in mathematical physics. For example, if a certain tensor relationship can 
be shown to be true in rectangular Cartesian coordinates, then it is also true 
in general curvilinear coordinates in Euclidean space. Thus, once an equa¬ 
tion is established in rectangular Cartesian coordinates, the corresponding 
equation (stating a physical fact, such as a condition of equilibrium, of con¬ 
servation of energy, etc.) in any specific curvilinear coordinates in Euclidean 
space can be obtained by a straightforward “translation” in the language of 
tensors. The key word here is “tensor.” Every term in the equation must 
be a tensor. Partial derivatives are tensors only in Cartesian cordinates. 
They are not tensors in curvilinear coordinates. Covariant differentiation 
must be used to generate new tensors in curvilinear coordinates. 

As an example of the application of these remarks, let us consider the 
successive covariant derivatives of a tensor field. Since the covariant deriva¬ 
tive of a tensor field is a tensor field, we can form the covariant derivative 
of the latter, which is called the second covariant derivative of the original 
tensor field. If T^j denotes the original tensor field, we can consider the sec¬ 
ond covariant derivative T^| 7 j. Now, if the space is Euclidean, then it can 
be described by a rectangular Cartesian coordinate system. In Cartesian 
coordinates, the Euclidean Christoffel symbols are all zero and the covariant 
derivatives of a tensor field reduce to partial derivatives of the tensor field. 
But partial derivatives are commutative if they are continuous. Hence, we 
see that the following equation is true in Cartesian coordinates if every 
component is continuous, 

( 1 ) T ki\-tS = T H\s~n 

and, therefore, it is true in all coordinates that can be obtained by admis¬ 
sible transformations from a Cartesian system. 

We must remark that the commutativeness of the covariant differentia¬ 
tion operation is established above only in Euclidean space. A coordinate 
system in a more general Riemannian space may not be transformed into 
Cartesian coordinates and the method of proof used above cannot be ap¬ 
plied. In fact, the theorem expressed in Eq. (1) is, in general, untrue 
in Riemannian space: successive covariant derivatives are, in general, not 
commutative in the Riemannian space. (See Prob. 2.31 below.) 
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As a second application we can prove the following theorem: the co¬ 
variant derivatives of the Euclidean metric tensor gij and the associated 
contravariant tensor are zero: 


( 2 ) 


9ij\k = 0 , g lj \ k =0. 


Since gij\k and g^\k are tensors, the truth of the theorem can be established 
if we can demonstrate Eq. (2) in one particular coordinate system. But 
this is exactly the case in Cartesian coordinates, in which g XJ and g lJ are 
constants and, hence, their derivative vanish. Thus, the proof is completed. 
In contrast to Eq. (1), however, it can be proved that Eqs. (2) remain true 
in Riemannian space. 

Further examples are furnished in Probs. 2.17 to 2.23. 

To apply this powerful procedure, one must make sure that all quantities 
involved are tensors. In particular, we must ascertain that all scalars are 
“absolute” constants. This remark is very important because in physics we 
also use quantities that transform like relative tensors. A relative tensor 
of weight w is an object with components whose transformation law differs 
from the tensor transformation law by the appearance of the Jacobian to 
the ruth power as a factor. Thus, 


(3) 


m = 



(4) 


m = 


dxj 

dfp 


re*) 


dOi 
dx a ’ 


are the transformation laws for a relative scalar field of weight w and a 
relative contravariant vector field of weight w, respectively. If w = 0, we 
have the previous notion of a tensor field. Whether an object is a tensor or 
a relative tensor is often a matter of definition. 

As an example, consider the total mass enclosed in a volume expressed in 
terms of density. Let Xi be rectangular coordinates which are transformed 
into curvilinear coordinates 9j. We have 


( 5 ) 


-//X Po(x l ,X2,X 3 )dxi,dX2,dX3 

= Jl’J v Pa[x( 0 )} 


| d8idd2dd 3 
-E p(9 1, #2> 0 3 )ddid 82 dd 3 . 
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If p(0) in the last term is defined as the density distribution in the 61- 
coordinates, then it is a relative scalar of weight one. On the other hand, 
Pq [x($)] = po{x) is an absolute scalar which defines the (physical) density 
of the medium. 

As another example, consider the determinant g of the Euclidean metric 
tensor g L j whose transformation law is 

( 6 ) *■«<»> = 

Let g = \g a p\, the determinant of g a p. By a double use of the formula for 
the product of two determinants when applied to Eq. (6), it is easy to prove 
that 

( 7 ) 9(G) = 9(x), 

which shows that g is a relative scalar of weight two. It follows that ^fg is a 
relative scalar of weight one. We note that if aq are rectangular coordinates, 
<7=1. Hence Eq. (5) shows that \/g[x(6)] = \dx l /dO J \, the Jacobian of 
the transformation. Thus the volume enclosed by a closed surface can be 
written as 


( 8 ) 



dxidx^dxs 



d6\d92dQ$ 



The last integral shows the importance of ^fg in mechanics. 

The method of tensor equations does not apply to relative tensors. 
Therefore it is important to properly define all quantities involved in an 
equation to be tensors. 

2.14. GEOMETRIC INTERPRETATION OF TENSOR 
COMPONENTS 

Before we conclude this chapter we shall consider briefly the geometric 
interpretation of tensor components. For this purpose we must use the 
concept of base vectors. We know that in a three-dimensional Euclidean 
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space any three linearly independent vectors form a basis with which any 
other vectors can be expanded as a lin¬ 
ear combination of these three vectors. 

When a rectangular Cartesian frame of 
reference is chosen, we can choose as 
base vectors the unit vectors ii, i 2,13 
parallel to the coordinate axes: thus, 
if a vector A has three components 
(Ai, A 2 , A 3 ), we can write 

A = Ajii + A 2 I 2 + ^313 • 

In a curvilinear coordinate system in 
a Euclidean space, we shall introduce 
the base vectors from the following 
consideration. 

Let dR denote an infinitesimal vector PQ joining a point P = 
(xi,X 2 ,X 3 ) to a point Q = (xj + dxi,X 2 +dx 2 , x^+dx^) where x\, X 2 , x$ are 
referred to a rectangular Cartesian frame of reference. (See Figure 2.14:1.) 
Then, obviously, 

( 1 ) dR = dx r i r = dx r \ r , 

where ii, i 2 , i 3 , or i 1 , i 2 , i 3 denote the base vectors along coordinate axes. 
Here, since a rectangular Cartesian coordinate system is used, we can as¬ 
sign arbitrarily an index as contravariant or covariant. In the assignment 
chosen above, dx r and dx r are, respectively, contravariant and covariant 
differentials. 

Now let us consider a transformation from the rectangular Cartesian 
coordinates x 1 to general coordinates 6\ 

(2) e l = 6 l ( xi,X2,x 3 ). 



Pig. 2.14:1. Base vectors. 


According to the tensor transformation law, when dx r and dx r are regarded 
as tensors of order one, their components in the general coordinates become 


(3) 

dO j = ^rdx\ 
dx 1 

dr 1 — —— dO 3 

dX ~ d6i ’ 

(4) 

dx * 

d9j = mi dXi ’ 

dO j 

dxi - - ——r dOj . 
dx 1 3 


Now in (3), dOi can be identified as the usual differential of the variable 
6P as specified in Eq. (2); hence, the superscript is justified. But in Eq. (4), 
dOj, is not to be identified with the usual differential; dOj is a covariant 
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differential that will have a different geometric meaning as will be seen 
later. 

By Eqs. (3) and (4), we may write Eq. (1) as 


(5) A 

dR = 

g r d0 r = g r d0 r 

where 

(6) 

dx s . 

r " r -s 

gr = W la 

’ S ~ dx s 1 


Since g r and g r are linear combinations of unit vectors, they are themselves 
vectors; they are known as the covariant and contravariant base vectors , 
respectively, or as the base vectors and reciprocal base vectors. Equation (5) 
shows that 


(7) A 


<9R 

gi ~ de i ' 


Hence gj characterizes the change of the position vector R as 0 l varies. In 
other words, gj is directed tangentially along the coordinate curve 6 l . These 
vectors are illustrated in Fig. 2.14:2. 



Fig. 2.14:2. Contravariant and covariant components of a vector v in two dimensions. 


It is easily verified that 


gr ' gs = 9rs , g r • g S = g rS , 
g r ' Ss = 9s = » 


(8) A 

(9) A 

(10) A 


g = 9 gs , gr = grsg' 
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where g rs is the Euclidean metric tensor of the coordinate system, and 
g rs is the associated, or conjugate metric tensor. Prom Eq. (9) it is clear 
that the contravariant base vectors g 1 , g 2 , g 3 are, respectively, perpendicular 
to the planes of g 2 g 3 ,g 3 gi.gig 2 - (See also Prob. 2.28.) For orthogonal 
coordinates, that means g, • g j = 0 for i ^ j, it can be shown that g* is in 
the same direction as g, and that g* • g J = 0 for i =f= j. 

We can now deal with the contravariant and covariant components of 
a vector. Consider the expression v r g r , where v r is a contravariant tensor 
of rank one, and g r are the covariant base vectors at a generic point. This 
expression remains invariant under coordinate transformations, and, since 
it is a linear combination of the base vectors g r , it is a vector, which may 
be designated v: 

(11) v = v r g r . 

By Eq. (10), replacing g r by g rs g s , we also have 

(12) ▲ v = i/g r = u,g s , 

where 

(13) ▲ v s = g TS v r , v T ~ g r3 v a . 

According to Eq. (12), if we represent v by a directed line, then the con¬ 
travariant components v r are the components of v in the direction of the co¬ 
variant base vectors, while the covariant components v T are the components 
of v in the direction of the contravariant base vectors. A two-dimensional 
illustration is shown in Fig. 2.14:2. 

Equation (12) justifies naming v r the contravariant components of the 
vector v, and v r the covariant components of v. Thus, in our Euclidean 
space, the tensors v r and v r are two different representations of the same 
vector v. Equations (13) establish the process of raising and lowering of 
indices. 

Having recognized the covariant and contravariant base vectors of each 
coordinate system, we can use them to derive the transformation laws of 
vectors and tensors from one coordinate system to another. Let g ! , g' and 
g,, Si be, respectively, the contravariant and covariant base vectors of two 
coordinate systems of reference. We define 

Pm = gm • g r = \g mm g rr \ COS(g m , g r ) , 

(14) 

pm = g m • gr := \g mm g rr \ cos(g m , g r ), 

in which m and r are not summed. We use a dot in the superscript or sub¬ 
script to emphasis the difference between and P r m [= i<?mm<? r, '|cos(g m , g r )J 
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(m, r not summed). In general, ff m A 0 r m A Am A 0™. Here the first 
indices of the superscript and the subscript are associated with the coordi¬ 
nates with base vectors g’s and the second indices with g’s. The distinction 
between Am and 0 r m disappear in Cartesian coordinates. 

One can easily show that 


(15) 

gr = (gr ' 

■ g m )gm = P r m grn , 

(16) 

gm ' (gm 

• g fc )g/t = ( S k ■ Sm)gk = Amlfc 


and that the /3’s are related by the metric tensors of the two coordinate 
systems: 

PI = 9 mp P P q 9 rq , Am = 9 ^P; 9qm ■ 

From Eqs. (6) and (14), it can be seen that 


(17) 


AT = gr ' g"‘ 


dx m 
dx r ' 


PL = g* ' g. 


dx k 
dx m ' 


Thus, Eqs. (15) and (16) establish that the transformation between g’s and 
g’s follows the tensor transformation law. A substitution of Eq. (16) into 
Eq. (15) and vice versa yield 

gr = P r k Sk = Pr k P r kSm , gm = Am 8* = Am At" gr , 


which implies that 

(18) Pr k n = Am . P k m Pk=S, 


We can now determine the transformation law for the components of v 
between different coordinate systems. Dotting both sides of the equation 


v r g r = v m g m 

with g 1 gives the transformation law 

(19) A «* = (g ! • g m )v m = P' m v m . 

Similarly, dotting the same equation with g J gives the inverse transforma¬ 
tion 


(20) ▲ A = (g r ■ g j )v r = 00 v r . 

Using the relations of the contravariant base vectors 

(21) ▲ g r = (g r ' gm)g"“ = A.mg m . g 7 " = (gr ' g m )g r = PT& > 
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together with v = v r g r — v m g m , we obtain 

(22) A v r — (g r • g m )v m = f3 r m v m , v m = (g r ■ g m )v r = /3 r m v r . 

If both coordinate systems are rectangular Cartesian coordinates, gj = 
g' and gj = g‘. There is no need to distinguish the superscripts and the 
subscripts of all quantities and g’s, g’s are unit base vectors. Then Eq. (14) 
reduces to 

(23) (3 r m = COs(g r , g m ) = gr ■ gm = gm ■ gr , 

that firm is the direction cosine of the angle between g r and g m . Equa¬ 
tion (18) becomes 

(24) firkfimk = fikrfikm ~ &rm > 

i.e., the transpose of the Cartesian tensor f3 rm is the inverse of /3 rm . Equa¬ 
tions (19), (20) and (22) become 

(25) V r = (gr ■ gm)v m = f3 rm V m , V m = (g r ■ g m )l) r = firmer • 

Note that in general we still have /3 rm ^ [i rnr for r ^ m. 

If v's and u’s are the coordinates of the two rectangular Cartesian co¬ 
ordinate systems, Eq. (25) gives the transformation law between the two 
coordinate systems as stated in Eqs. (2.8:2,3) for zero translation. 

Similar concept can be generalized to deal with the contravariant and 
covariant components of tensors of rank two or higher. Consider the ex¬ 
pression 

A = A r3 g r g s , 

where A T3 is a contravariant tensor of rank 2. By Eq. (10), replacing g r by 
9rmg m , we also have 


A = A r3 g r g s = A mn g m g n . 

The expressions above are invariant under coordinate transformation, i.e., 

(26) A = A^grg, = A m "g m g„ = A rs g r g 3 = A mn g m g n , 

in two different coordinate systems of reference. To examine the relations 
between A rs and A rs , we substitute Eq. (16) into Eq. (26) and get 

(27) A = A r3 grg,, ~ A mn /3f m f3 s n grg s , or A rs ~ (3 r m (3 3 n A mn . 

The substitution of Eq. (15) into Eq. (26) yields 

(28) A = A mn g m g„ = A r3 f3 r m f3 s n g m g n or A mn = p r m Ps n A rs . 
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Similarly, using the relations of the contravariant base vectors given in 
Eq. (21), we obtain 

(29) A rs = (3 r m f3 s n A mn and A mn = /3 r m f3 s n A rs 

If both coordinate systems are rectangular Cartesian, Eqs. (27), (28) and 

(29) becomes 

(30) A ra = PrmPsnAmn and A mn = PrmPsnA rs ■ 

These formulas tell us how base vectors and tensor components are 
changed in coordinate transformation. Coordinate transformation occupies 
a uniquely important position in mechanics for many reasons. One reason 
is exemplified by Einstein’s using it to develop the theory of relativity. 
Another reason is that the shape of a natural object of interest to science 
and engineering often has a natural, preferred coordinate system for its 
description, e.g., the earth is round, the rail is straight, and the egg is egg- 
shaped. Whichever be the reason, we often find it desirable to transform an 
equation written in one coordinate system to one valid in another cordinate 
system. For these tasks the method illustrated above can be helpful. 

2.15. GEOMETRIC INTERPRETATION OF COVARIANT 
DERIVATIVES 

Consider now a vector field v defined at every point of space in a region 
R. Let the vector at the point P(Q X ,8 2 ,8 2 ) be 

(1) v(P) = v\P) gi (P ). 

At a neighboring point P'{8 X dO 1 ,9 2 + dd 2 ,6 3 4- d8 z ), the vector becomes 
v(P') = v(P) + dv(P) 

=,{v i (P)+dv i (P)}[gi(P) + dg i (P)}. 

On passing to the limit d0 l —> 0, we obtain the principal part of the 
difference 

(2) dv = (u i + dv l )(gi + dgi) - v l gi = g idv 1 + v l d gi , 


and the derivative 


dv dv 1 dgi 

d& = gi d£b + Wi 


(3) A 
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Thus the derivative of the vector v is resolved into two parts: one arising 
from the variation of the components v l as the coordinates 9 l ,8 2 ,9 3 are 
changed, the other arising from the change of the base vector gj as the 
position of the point 8 l is changed. It is shown below that, in a Euclidean 
space, 


(4) A 
Hence 


(5) A 


dgi 

09i 


= r“ g 0 


d\ dv a : 

d& = Sa d9j + V ijSa 


dv a 

W 


+ wT“, g Q = v a \ig a . 


Thus, the covariant derivative v a \j represents the components of dv \ d8 3 
referred to the base vectors g a . 

To establish Eq. (4), we differentiate the equation Qij = gi-gj to obtain 


®9ij 

d9 k 


d8 k 


Sj + 


d9 k 


Si 


On permuting the indices i,j,k, we can obtain the derivatives dgij/d9\ 
dgik/d9 l . Furthermore, since g* = dR/d9 l , we have 


dgi 

d9i 


d 

/9 R \ 

d 

( Q R\ 

d9i 1 

\d9 i ) 

~ d9 i 

I 393 1 


dgj 
d9 i ‘ 


Hence, by substitution, it is easy to show that 

1 (dgik dg jk _ dg^ \ = 9gi 

2 V 98i + d8 i d8 k ) d9i ' Sk ' 

On multiplying the two sides of the equation by g ak and summing over k, 
the left-hand side becomes ]?“■, according to the definition on Christoffel 
symbols. We then multiply the scalar quantity on both sides of the equation 
by the vectors g a and sum over a to obtain 
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The right-hand side, the sum of the vectors g k multiplied by the scalar 
quantities (|^- ■ gt) is exactly equal to the vector dg l /d6 j itself. To see 
this, we note that since the set of vectors g 1 , g 2 , g 3 are linearly independent 
and form a basis of the vector space, dgi/dd^ can be expressed as a linear 
combination 

(7) = Aig 1 + A 2 g 2 4- A 3 g 3 , 

where Ai, A 2 , A 3 are scalars. If we multiply this equation by gi, we obtain 


( 8 ) 


Ai 


del 


•gi ■ 


Similarly, A 2 ,A 3 can be evaluated. A comparison of Eqs. (6), (8), and (7) 
thus shows the truth of Eq. (2.15:4): 


r“g Q 


0gi 
d(P ' 


Q.E.D. 


2.16. PHYSICAL COMPONENTS OF A VECTOR 

The base vectors g r and g r are in general not unit vectors. In fact, their 
lengths are 

|gr| = \fQrr > |g' | = 'JW* i r not summed. 

Let us write Eq. (2.14:12) as 


( 1 ) 


v = 


X>'v^ 

r—1 


gr 

\/Srr 


y/<r 


Then, since g r /yf(hf and g r / y/ff 7 are unit vectors, all components v r ^/gff 
and D rV ^ (r not summed) will have the same physical dimensions. It is 
seen that v r ^/gff are the components of v resolved in the direction of unit 
vectors g r / v /g which are tangent to the coordinate lines; and that Vry/g™ 
are the components of v resolved in the direction of unit vectors g r / s/g^ 
which are perpendicular to the coordinate planes. The components 


v yj9rr t V r y/g ,r , 


r not summed, 


are called the physical components of the vector v. They do not transform 
according to the tensor transformation law and are not components of ten¬ 
sors. Note that, in general v r y/gff ^ Vry/g™ (r not summed) except for 
orthogonal coordinates. 

We should remember that the tensor components of a physical quantity 
which is referred to a particular curvilinear coordinate system may or may 
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not have the same physical dimensions. This difficulty (and it is also a great 
convenience!) arises because we would like to keep our freedom in choosing 
arbitrary curvilinear coordinates. Thus, in spherical polar coordinates for 
a Euclidean space, the position of a point is expressed by a length and 
two angles. In a four-dimensional space a point may be expressed in three 
lengths and a time. For this reason, we must distinguish the tensor compo¬ 
nents from the “physical components,” which must have uniform physical 
dimensions. 


PROBLEMS 

2.12. Show that in an orthogonal n-dimensional coordinates system, we 
have for each component i = j, g' 3 = 1 /<?y. 

2.13. If ay is a tensor, and the components ay — aji, then the tensor a y 
is called a symmetric tensor. If the components ay = —aji, then the tensor ay 
is said to be skew-symmetric, or antisymmetric. Show that the symmetry of a 
tensor with respect to two indices at the same level is conserved under coordi¬ 
nate transformations. Since ay = §(ay + a ]t ) + L(ay - aji), any covariant (or 
contravariant ) second-order tensor can be written as the sum of a symmetric and 
a skew-symmetric tensor. 

2.14. Show that the scalar product of a symmetric tensor S' 3 and a skew- 
symmetric tensor Wy vanishes indentically. 

2.16. Show that the Cartesian tensor u> ik = e, ]k Uj is skew-symmetric, where 
Uj is a vector. 

2.16. Show that if Aj k is a skew-symmetric Cartesian tensor, then the 
unique solution of the equation o>< = \eij k Aj k is A mn = e mn ,w,. 

2.17. Let V be the operator 


Show that 

grad 4> = V( i ) = & r §^ > 
divF = V F = F r | r , 
curlA = V x A = e rat A s | r gt. 

Show that these functions are invariant under coordinate transformations. 

2.18. Let g a>3 be the associated contravariant Euclidean metric tensor and 
ipix 1 ,x 2 ,x 3 ) be a scalar field. Show that 

(a) g a!3 ip\ a f 3 is a scalar field. 
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(b) In rectangular Cartesian coordinates g a p = 5 a p, the scalar g a/3 ip\ a p reduces 
to the form (writing x 1 — x, x 2 = y, x 3 = z) 

d 2 ip d 2 ip d 2 ip 
dx 2 dy 2 ^ dz 2 

(c) Hence, the Laplace equation in curvilinear coordinates with the scalar field 
ip(x l ,x 2 ,x 3 ) is given by 

g°‘ l3 (x 1 , x 2 , x 3 )ip(x 1 , x 2 , x 3 )\ a p = 0, 

2.19. Let y l ,y 2 ,y 3 (or x,y,z) be rectangular Cartesian coordinates and 
i 1 ,! 2 ,! 3 (or r, 0, 0) be the spherical polar coordinates. Show that the Laplace 
equation in spherical polar coordinates, when the unknown function is a scalar 
field ip(r, <p, 9), is 

d 2 ip 1 d 2 ip 1 d 2 ip 2 dip cottp dip 

dr 2 r 2 dtp 2 r 2 sin 2 tp d9 2 r dr r 2 dtp 

2.20. Let Ciy 1 ,y 2 ,y 3 ) be the components of an unknown vector in rectan¬ 
gular Cartesian coordinates. Let each component satisfy Laplace’s equation in 
rectangular coordinates, 


& 2 e , a 2 e t ff 0 


(dy 1 ) 2 (dy 2 ) 2 (dy 3 ) 2 

Show that a generalization of this equation is that £ l (x l ,x 2 ,x 3 ) in curvilinear 
coordinates x 1 ,x 2 ,x 3 will satisfy the system of three differential equations 

ff a/3 C\a/3 =0, 

where t/\ a p is the second covariant derivative of ^'(x 1 , x 2 , x 3 ). 

2.21. Prove that 


F r | r = 


±divm 

s/9 dx r 


Hint: Since g = <?; a G la (i not summed), G' ] is the cofactor of the element gy in 
g = |<Rj|, show that 


and that 


Hence, 




= 


dx 


log s/g. 
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2.22. Prove that the Laplacian of Prob. 2.18 can be written 

,,,, 1 dly/ggHW/dx*)] 

9 Jg dx * 

Hint: Use the results of Prob. 2.21. 

2.23. Show that the covariant differentiation of sums and products follows 
the usual rules for partial differentiation. Thus, 


( <pV r )\i = (j>,iV r + <j)V r \i , 

(v r v r )\i = (v r v r ), i = V r \iV r + V r V r \i , 

(AijB™) | r = Ai^rB™ + AijB rnn \r , 


where a comma indicates partial differentiation. Remember that the covariant 
derivatives of a scalar are the same as the partial derivatives. 

2.24. Derive the transformation law for the Euclidean Christoffel symbols 
r^(x\x 2 ,x 3 ). Ans. Under a transformation of coordinates from x* to x‘, the 
Euclidean metric tensor transforms as 


g a p(x l ,x 2 ,x 3 ) = g liv (x l ,x 2 ,x 3 ) 


dx p dx v 
dx” dxfi 


Differentiating both sides of the equation with respect to x“, considering x“ as 
an independent variable, we obtain 


dg a p _ / dg pv dx p \ dx p dx v d 2 x p dx v dx p d 2 x v 

dx a \ dx p dx a ) dx a dx? ' 9lit> dx^dx 0 dx p * 9pw dx° dx^dx a 


On first interchanging a and a in this formula and then interchanging j3 and a, 
two similar expressions for dg a p/dx a and dg a <j/dx 0 are obtained. Adding and 
subtracting, with several further interchange of dummy indices, and recalling 
g vfl = g MUt we obtain 


1 / dg a!i dg aa _ dg a p \ _ 1 / dg pv dg Mf> _ dg pu \ dx p dx" dx p 

2 \ dx a dx& dx a ) 2 \ dx p dx v dx p ) dx a dx& dx a 

dx p d 2 x v 
^ 9l “ / dx a dxPdx a 


Now, 

-it, _ dx' dx a 

9 ~ 9 dx x dx“ ' 

A multiplication of the corresponding sides of the last two equations leads to the 
desired result. 


2.25. All the results obtained above can be applied to two-dimensional 
spaces by assigning the range of indices to be 1 to 2 instead of 1 to 3. Compute 
the Euclidean Christoffel symbols for the plane polar coordinates (x 1 = r,x 2 = 6 ) 
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Ans. r 22 = — i 1 , r? 2 = l/* 1 , all other components = 0. 

2.26. Show that is symmetric in m and n; i.e., = P” m . 

2.27. Show that the necessary and sufficient condition that a given curvi¬ 
linear coordinate system be orthogonal is that gij — 0, if i ^ j, throughout the 
domain. 

2.28. Prove that g,. xg, = g r x g s = e rst g t , where e rst , e Tat are 

the permutation tensor of Sec. 2.6. Hence, if we denote the scalar product of the 
vectors gi and g 2 x g 3 by fgig 2 g:s] or (gi,g 2 x g 3 ), we have 

[gigags] = (gi,g 2 x g 3 ) = y/g, [g 1 g 2 g 3 ] = (g\g 2 x g 3 ) = 1/y/g. 


2.29. The element of area of a parallelogram with two adjacent edges ds 2 = 
g idO 2 and ds 3 = g 3 dO 3 is 

dSi = |ds 2 x ds 3 | = |g 2 x g 3 |d0 2 d0 3 . 

Show that dSi = \/(gg u ) <10 2 d0 3 . In general, the element of area dSi of a par¬ 
allelogram formed by the e lements gjdfP and g kdd k , ( j , k not summed), on the 
-surface is d,S % = \/ifig' i ) d8 j d8 k (i not summed, i ^ j / k). 

2.30. With reference to Prob. 2.28, show that the volume element 
dV = dsi ■ (ds 2 x ds 3 ) = [gig 2 g 3 ] d6 i d6 2 d0 3 = gd6 1 dd 2 d6 3 . 


2.31. If the space is non-Euclidean, we cannot find a coordinate system in 
which the metric tensor g < 3 - has constant components everywhere throughout the 
space. (In a Euclidean space there do exist just such coordinate systems, namely, 
Cartesian coordinate systems.) In this case, we must compute the successive 
derivatives £ r | st and £ r |ts according to the covariant differentiation rules. Show 
that 

£ l»* — £ its = Rpst£ P I 


where 


Rpat 


dx‘ 


d?U 

dx s 


+ _ rLr* 


Show that R' p3t is a tensor of rank 4, which is the famous Riemann-Christoff el 
curvature tensor. It is not a zero tensor in a general Riemannian space. Hence, 
in general, 


£‘|si / £’|*s. 


in a Riemannian space. 

Note\ The results obtained above hold true also for two-dimensional spaces, 
provided all indices range over 1 and 2 only. A curved shell in a three-dimensional 
Euclidean space appears, in general, to be a two-dimensional non-Euclidean space 
to a “two-dimensional” animal who has to measure distances right on the shell 
surface and is never allowed to leave the shell surface to view the third dimen¬ 
sion. For a spherical surface, the nonvanishing components of the two-dimensional 
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Riemann-Christoffel curvature tensor are all equal to a constant, which may be 
written as 1. For a flat plate, they are zero. For certain hyperboloidal surface all 
the nonvanishing components of curvature may take on the value —1. Since the 
spirit of the theory of thin elastic shells is to reduce all the properties of the shells 
into differential equations describing the middle surface of the shell, an engineer 
deals with non-Euclidean geometry rather frequently. 

2.32. In cylindrical coordinates with unit base vectors (e r ,ee,e*), a vector 
u and its gradient, gradient transpose and divergent are, respectively, 


u = u r e r + ugeg + u z e z , 

_ flu 1 flu flu 

Vu= d^ er + rde ee+ d-z 


,t flu eg flu flu 

< Vu > * e 'S: + 78» +e -aI' 


i i <?u eg 

V-u = Ui=e r - — -\ - 

or r 


du du _ 

dO dz 


1 dru r 
dr 


1 dug du 
+ r ~d0 + 


dz 


Show that, V-(Vu) = V(V-u) = u % \ij in general, and for cylindrical coordinates, 


V . (Vu) = u‘|„ = 

V • (Vu) T = u^kjg 


9(V ■ u) _ , 1 fl(v • u) a(v-u) 

dr r r d0 6 dz 

j* = I ± ^ . 

r dr \ dr ) r 2 dO 2 dz 2 


e 2 , 


2.33. In spherical coordinates with unit base vectors (e r ,e^,,ee), we have 

_ flu 1 flu 1 du 

u = U r e r + u^es + Ugeg , vu=— e r + --Tre^- :—7 eg , 

dr r dtp r sin tp dO 

(T7 \ T = du eg flu eg du 
U 6r dr r d<j> r sin <j> dO 

_ _ flu du eg flu _ 1 dr 2 u r 1 A sin tfruj, 

U Br dr r dcj) "** r sin tj> dO r 2 dr r sin <j> dtj> 

1 dug 
r sin tj> dO 


Show that 


__ /TT \ i | fl(V u) , lfl(V-u) , 1 fl(V u) 

V • (Vu) = u y = -k--e r + - e* + —— ■ ■- eg . 

dr r dtp r sin <p dO 


V • 


. T i| ,/t 1 9 / 2 Au\ 1 fl ( . .flu 

(Vu) = U \ki9 3 = -J+— 


1 fl 2 u 
sin 2 <f> d® 2 


These equations are needed in the Navier equations in later chapters. 
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STRESS TENSOR 


The definitions of stress vector and stress components will be given and 
the equations of equilibrium will be derived. We shall then show how the 
stress components change when the frames of reference are changed from 
one rectangular Cartesian frame of reference to another, and in this way 
we will prove from physical standpoint that the stress components trans¬ 
form according to the tensor transformation rules. The symmetry of the 
stress tensor will then be discussed, and the consequences of the symme¬ 
try property will be derived. The principal stresses, the stress deviations, 
the octahedral stresses, and finally, the stress tensor in general curvilinear 
coordinates, form the material for the remainder of the chapter. 

Except for Sec. 3.14 et seq., we shall use only rectangular Cartesian 
frames of reference, whose coordinate axes will be denoted by aq, X 2 , £3 

and are rectilinear and orthogonal to 
each other. We shall use subscripts 
for all components unless stated 
otherwise. 

3.1. STRESSES 

Consider a configuration occupied 
by a body B at some time (Fig. 3.1:1). 
Imagine a closed surface S within B. 
We would like to know the interaction 
between the material exterior to this 
surface and that in the interior. In this 
basic defining concept of continuum 
mechanics — the stress principle of Euler and Cauchy. 

Consider a small surface element of area A S on our imagined surface S. 
Let us draw a unit vector v normal to AS , with its direction outward from 
the interior of S. Then we can distinguish the two sides of AS according 
to the direction of u. Consider the part of material lying on the positive 
side of the normal. This part exerts a force AF on the other part, which is 
situated on the negative side of the normal. The force AF is a function of 



Fig. 3.1 :1. Stress principle, 
consideration, there arises the 
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the area and the orientation of the surface. We introduce the assumption 
that as AS tends to zero, the ratio AF/AS tends to a definite limit dF/dS 
and that the moment of the forces acting on the surface AS about any point 
within the area vanishes in the limit. The limiting vector will be written as 


( 1 ) 


T = 


dF 

dS ’ 


where an overhead v is introduced to denote the direction of the normal 
v of the surface AS. The limiting vector T is called the stress vector , or 
traction, and represents the force per unit area acting on the surface. 

The assertion that there is defined upon any imagined closed surface 
S in the interior of a continuum a stress vector field whose action on the 


material occupying the space interior 
to S is equipollent to the action of the 
exterior material upon it, is the stress 
principle of Euler and Cauchy. 

Consider now a special case in which 
the surface ASk is parallel to one of the 
coordinate planes. Let the normal of 
ASk be in the positive direction of the 

Xk- axis. Let the stress vector acting on 
k 

ASk be denoted by T, with three com- 
k k k 

ponents Ti, T 2 , T 3 along the direction 
of the coordinate axes xj, 22 , x 3 > re¬ 


Til 



*1 

Fig. 3.1:2. Notations of stress 
components. 


spectively. The index i of T* denotes 

the components of the force, and the symbol k indicates the surface on 
which the force acts. In this special case, we introduce a new set of symbols 
for the stress components, 


( 2 ) 


k k k 

Tl — Tfci , T 2 = Tfc2 , T3 = Tfc3 . 


If we arrange the components of the tractions acting on the surfaces k = 1, 
/c = 2 ,fc = 3ina square matrix, we obtain 

Components of Stresses 
12 3 

Surface normal to x\ rn T 12 T 13 

Surface normal to £2 Tu T 22 223 

Surface normal to 23 T31 T32 T33 
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This is illustrated in Fig. 3.1:2. The components th, r 22 , t 33 are called 
normal stresses, and the remaining components r 12 , ir 3 , etc., are called 
shearing stresses. Each of these components has the dimension of force per 
unit area, or that of [Mass] [Length] -1 [Time] -2 . 

A great diversity in notations for stress components exists in the lit¬ 
erature. The most widely used notations in American literature are, in 
reference to a system of rectangular Cartesian coordinates x, y, z. 




Love writes X x , Y x for a x and r xy , and Todhunter and Pearson use xx, 
xy. In this book we shall use both Ty and otj. We use Tij to denote 
stress tensors in general, and we use o'ij to denote the physical compo¬ 
nents of stress tensors in curvilinear coordinate (see p. 86). In rectangular 
Cartesian coordinates, the tensor components and the physical components 
coincide. Hence, we use oij for Cartesian stress tensors. Although the lack 

of uniformity may seem awkward, lit¬ 
tle confusion will arise, and in many 
instances different notations actually 
result in clarity. 

It is important to emphasize again 
that a stress will always be understood 
to be the force (per unit area) which the 
part lying on the positive side of a sur¬ 
face element (the side on the positive 
side of the outer normal) exerts on the 
part lying on the negative side. Thus, 
if the outer normal of a surface element 
points in the positive direction of the 
11 -axis and Tn is positive, the vector 
representing the component of normal 
stress acting on the surface element will 
point in the positive a: 1 -direction. But 
if Tij is positive while the outer normal points in the negative xi-axis direc¬ 
tion, then the stress vector acting on the element also points to the negative 
Zi-axis direction (see Fig. 3.1:3). 

Similarly, positive values of ti 2 , t 13 will imply shearing stress vectors 
pointing to positive x 2 , x 3 -axes if the outer normal agrees in sense with 



Fig. 3.1 :3. Senses of positive stresses. 


(3) 
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£j-axis, whereas they point to the negative £ 2 , a^-direction if the outer 
normal disagrees in sense with the £i-axis, as illustrated in Fig. 3.1:3. A 
careful study of the figure is essential. Naturally, these rules agree with the 
usual notions of tension, compression, and shear. 

3.2. LAWS OF MOTION 

The fundamental laws of mechanics for bodies of all kinds are Euler’s 
equations, which extend Newton’s laws of motion for particles. Let the 
coordinate system x\, x< 2 , £3 be an inertial frame of reference. Let the 
space occupied by a material body at any time t be denoted by B(t). Let 
r be the position vector of a particle with respect to the origin of the 
coordinate system. Let V be the velocity vector of a particle at the point 
x\ , £ 2 , ^ 3 - Then 

(1) V= f Vpdv, 

JB(t) 

is called the linear momentum of the body in the configuration B(t), and 
let 

(2) U = [ r X Vpdv , 

J B(t) 

is called the moment of momentum. In these formulas p is the density of 
the material and the integration is over the volume B(t). Newton’s laws, 
as stated by Euler for a continuum, assert that the rate of change of linear 
momentum is equal to the total applied force J- acting on the body, 

(3) V = fF, 

and that the rate of change of moment of momentum is equal to the total 
applied torque C, 

( 4 ) n = C. 

The torque C is taken with respect to the same point as the origin of the 
position vector r. It is easy to verify that if (3) holds, then if (4) holds for 
one choice of origin, it holds for all choices of origin, t 

It is assumed that force and torque are quantities about which we have 
a priori information in certain frames of reference. 

On material bodies considered in mechanics of continuous media, two 
types of external forces act: 

tThe derivatives "P and ft are material derivatives; i.e., the time rate of change of P’and 
'H of a fixed set of particles (cf. Secs. 5.2 and 5.3). 
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1. Body forces, acting on elements of volume of the body. 

2. Surface forces or stresses, acting on surface elements. 


Examples of body forces are gravitational forces and electromagnetic 
forces. Examples of surface forces are aerodynamic pressure acting on a 
body and pressure due to mechanical contact of two bodies. 



Fig. 3.2:1. Body forces. 


To specify a body force, we consider 
a volume bounded by an arbitrary sur¬ 
face S (Fig. 3.2:1). The resultant force 
vector contributed by the body force is 
assumed to be representable in the form 
of a volume integral 

f Xdv. 

J B 

The three components of X, namely, 
Xj, A 2 > X 3 , all of dimensions force per 
unit volume M(LT)~ 2 , are called the 
body force per unit volume. For exam¬ 
ple, in a gravitational field, 

X% = PQi ) 


where gi are components of a gravitational acceleration field and p is the 
density (mass per unit volume) at a given point of the body. 

The surface force acting on an imagined surface in the interior of a body 
is the stress vector conceived in Euler and Cauchy’s stress principle. Ac¬ 
cording to this concept, the total force acting upon the material occupying 
the region B interior to a closed surface S is 

(5) T = I T dS + f Xdv, 

Js Jb 


V 

where T is the stress vector acting on dS whose outer normal vector is v. 
Similarly, the torque about the origin is given by the expression 

(6) f=|rx T dS + f r x Xdv. 

Js Jb 

In the following section we shall make some elementary applications of 
these equations to obtain the fundamental properties of the stress tensor. 
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3.3. CAUCHY’S FORMULA 

With the equations of motion, we shall first derive a simple result which 
states that the stress vector T( + ) representing the action of material exte¬ 
rior to a surface element on the interior is equal in magnitude and opposite 
in direction to the stress vector T^ _ ) which represents the action of the 
interior material on the exterior across the same surface element: 

(1) A T ( ~ } = -T (+ ). 

To prove this, we consider a small “pill box” with two parallel surfaces of 
area AS and thickness <5, as shown in Fig. 3.3:1. When S shrinks to zero, 
while AS remains small but finite, the 
volume forces and the linear momen¬ 
tum and its rate of change with time 
vanish, as well as the contribution of 
surface forces on the sides of the pill 
box. The equation of motion (3.2:3) 
implies, therefore, for small AS, 

T (+ >AS + T<->AS = 0. 

Equation (1) then follows. 

Another way of stating this result is 
that the stress vector is a function of 
the normal vector to a surface. When the 
vector reverses, the stress vector reverses also. 

Now we shall show that knowing the components Tij, we can v>rite down 
at once the stress vector acting on any surface with unit outer normal vector 
u whose components are v\, i> 2 , ^ 3 • This stress vector is denoted by T, with 

V V V 

components T\, T?, T% given by Cauchy’s formula 

(2) A Ti = VjTji , 

which can be derived in several ways. We shall give first an elementary 
derivation. 

Let us consider an infinitesimal tetrahedron formed by three surfaces 
parallel to the coordinate planes and one normal to the unit vector u (see 
Fig, 3.3:2). Let the area of the surface normal to v be dS. Then the area 



Fig. 3.3:1. Equilibrium of a “pill 
tyox” across a surface S. 

I 

sense of direction of the normal 
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Fig. 3.3 :2. Surface tractions on a tetrahedron, 
of the other three surfaces are 
dS i = dS cos(j/, Xi) 

= v\dS = area of surface || to the a^zs-plane ) 

dS ‘2 — v^dS = area of surface || to the x 3 ti- plane, 

dS[\ = v$dS = area of surface || to the aqa^-plane, 

and the volume of the tetrahedron is 

dv — -hdS , 

3 

where h is the height of the vertex P from the base dS. The forces in 
the positive direction of Xi acting on the three coordinate surfaces can be 
written as 


(-Til +£i)dSi , (-721 + £ 2 )dS 2 , (-r 3 i + e 3 )dS 3 , 

where Tn, T 21 , T 31 are the stresses at the point P. The negative sign is 
obtained because the outer normals to the three surfaces are opposite in 
sense with respect to the coordinate axes, and the e’s are inserted because 
the tractions act at points slightly different from P. If we assume that 
the stress field is continuous, then E\, e 2 , £3 are infinitesimal quantities. 
On the other hand, the force acting on the triangle normal to v has a 
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component (Ti + £)dS in the rci-axis direction, the body force has an x\- 
component equal to (Xi +e')dv, and the rate of change of linear momentum 

. V 

has a component pVidv. Here Ti and Xi refer to the point P and e, e' 
are again infinitesimal. The first equation of motion is thus 

(-T 11 + £i)u\dS + (—721 + £ 2)^2 dS + (-731 + £ 3)^3 dS 

+ (Ti + e)dS + (Xi + e'f^hdS = pV 1 ^hdS. 

Dividing through by dS and taking the limit h -> 0, one obtains 

V 

(3) T\ = T 11 U 1 + T 21 U 2 + T 31 U 3 , 

which is the first component of Eq. (2). Other components follow similarly. 

Cauchy's formula assures us that the nine components of stresses Tij are 
necessary and sufficient to define the traction across any surface element 
in a body. Hence the stress state in a body is characterized completely by 

V 

the set of quantities r tJ . Since Ti is a vector and Eq. (2) is valid for an 
arbitrary vector Vj, it follows from the quotient rule (Sec. 2.10) that is 
a tensor. Henceforth Ty will be called a stress tensor. 

We note again that in the theoretical development up to this point we 
have assumed, first, that stress can be defined everywhere in a body, and, 
second, that the stress field is continuous. The same assumption will be 
made later with respect to strain. These are characteristic assumptions of 
continuum mechanics. Without these assumptions we can do very little 
indeed. However, in the further development of the theory, certain mathe¬ 
matical discontinuities will be permitted — often they are very useful tools 
— but one should always view these discontinuities with great care against 
the general basic assumptions of continuity of the stress and strain fields. 

3.4. EQUATIONS OF EQUILIBRIUM 

We shall now transform the equations of motion (3.2:3), (3.2:4) into dif¬ 
ferential equations. This can be done elegantly by means of Gauss’ theorem 
and Cauchy’s formula. But we shall pursue here an elementary course to 
assure physical clarity. 

Consider the static equilibrium state of an infinitesimal parallelepiped 
with surfaces parallel to the coordinate planes. The stresses acting on the 
various surfaces are shown in Fig. 3,4:1. The force Tndxzdxz acts on the 
left-hand side, the force (th + qff 1 dxi)dx 2 dx^ acts on the right-hand side, 
etc. These expressions are based on the assumption of continuity of the 
stresses. The body force is Xidx\dx 2 dx$. 
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Fig. 3.4:1. Equilibrating stress components on an infinitesimal parallelepiped. 


Now, the equilibrium of the body demands that the resultant forces 
vanish. Consider the forces in the aq-direction. As shown in Fig. 3.4:2, we 
have six components of surface forces and one component of body force. 
The sum is 

frn + dx\ \ dx 2 dx 3 - r n dx 2 dx 3 


+ I T 2 1 + ~Q^~dx2 J dx 3 dxi - T 2 \dx 3 dx\ 


, ( , dr 3 i 

+ I 731 + jj-- dx : 


dx\dx 2 — r 3 \dx\dx 2 + X\dxidx 2 dx 3 = 0 . 


Dividing by dx\dx 2 dx 3 , we obtain 

,, drn dr 2 i dr 3 ] 

dx\ dx 2 dx 3 


+ Xt = 0. 


A cyclic permulation of subscripts 1,2,3 leads to similar equations of equi¬ 
librium of forces in x 2 , ^-directions. The whole set, written concisely, is 


+ Xi = 0 . 


This is an important result. A shorter derivation will be given later in 
Sec. 5.5. 
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Fig. 3.4:2. Components of tractions in x\ direction. 


The equilibrium of an element requires also that the resultant moment 
vanishes. If there do not exist external moments proportional to a volume, 
the consideration of moments will lead to the important conclusion that the 
stress tensor is symmetric , 

(3) A Tij ~ Tji . 

This is demonstrated as follows. Referring to Fig. 3.4:3 and considering the 
moment of all the forces about the axis Ox^, we see that those components 



Fig. 3.4:3. Components of tractions that contribute moment about Ox 3 -axis. 
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of forces parallel to Ox 3 or lying in planes containing Ox 3 do not contribute 
any moment. The components that do contribute a moment about the Ox 3 - 
axis are shown in Fig. 3.4:3. Therefore, properly taking care of the moment 
arm, we have 


- m + 


dr u 


dx\ 


-dx 1 ) dx 2 dx 2 • ——^ + Tn dx 2 dx, 


dx 2 

~T 


+ ( T12 + ^~dx x ) dx 2 dx 3 ■ dx 1 
dxi J 


, 3t 2 1 

T 21 + —dx 2 

OX o 


dx\dxz • dx2 


. 3t 22 

722 + -5—022 

OX 2 


dx\dx$ ■ - T 22 dx\dx 2 ■ < ~ 


+ 


I ^ T32 J 

732 + -s-“^3 

0 x 3 


dx\dx 2 


dx 1 , , dxi 

—- T3 2 dxidx 2 ■ — 


. 9t 3 i 

731 + -3 - dXi 

OX 3 


dx\dx 2 


dx 2 , , dx 2 

— +T3idxidx 2 ■ — 


X\dx\dx 2 dx 2 • + X 2 dx\dx 2 dx 2 


dx 1 
~2~ 


0. 


On dividing through by dx\dx 2 dx 2 and passing to the limit dxq —> 0 , 
—> 0 , dx 3 -» 0 , we obtain 


(4) 712 = 721 • 

Similar considerations of resultant moments about Ox\, Ox 2 lead to the 
general result given by Eq. (3). Again a shorter derivation will be given 
later in Sec. 5.5. 

It should be noted that if an external moment proportional to the vol¬ 
ume does exist, then the symmetry condition does not hold. For example, 
if there is a moment c 2 dx 1 dx 2 dx 2 about the axis Ox 3 , then we obtain in 
place of Eq. (4) the result 

(5) 712 - 7 2 i + c 3 = 0 . 


Maxwell pointed out that nonvanishing body moments exist in a magnet 
in a magnetic field and in a dielectric material in an electric field with 
different planes of polarization. If the electromagnetic field is so intense 
and the stress level is so low that ri 2 and C 3 are of the same order of 
magnitude, then, according to (5), ti 2 cannot be equated to t 2 i. In this 
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case, we have to admit the stress tensor 7 y as asymmetric. If C 3 is very much 
smaller in comparison with T 12 , then we can omit C 3 in (5) and consider (4) 
as valid approximately. 

In developing a physical theory, particularly for the purpose of engineer¬ 
ing, one of the most important objectives is to obtain the simplest formula¬ 
tion consistent with the desired degree of accuracy. A decision of whether 
or not we shall treat the stress tensor as symmetric must be based on the 
purpose of the theory. Since electromagnetic fields pervade the universe, 
the stress tensor is in general unsymmetric. But, if the theory is formulated 
for the purpose of a structural or mechanical engineer who studies the stress 
distribution in a structure or a machine with a view towards assessing its 
strength, stability, or rigidity, then a stress is important when it is of the 
order of the yielding stress of the material. Even for a structure which is 
designed primarily on the basis of stability, such as a column, an arch, or 
a thin-walled shell, a good design should produce a critical stress of the 
order of the yielding stress under the critical conditions, for otherwise the 
material is not economically used. When Ty is comparable to the yielding 
stress in magnitude (of order 10,000 to 100,000 lb/sq in. or 70 to 700 mPa 
for a steel, or 50 to 5,000 lb/sq in. or 0.35 to 35 mPa for a concrete), there 
are few circumstances in which the assumption of symmetry in stress tensor 
should cause concern. 

However, if one wants to study the influence of a strong electromagnetic 
field on the propagation of elastic waves, or such influence on some high- 
frequency phenomenon in the material, then the stress level may be very 
low and the body moment may be significant. In such problems the stress 
tensor may not be assumed symmetric. 

In the rest of this book the stress tensor will be assumed to be symmetric 
unless stated otherwise. 

Notes on Couple-stresses 

If, following Voigt, we assume that across any infinitesimal surface el¬ 
ement in a solid the action of the exterior material upon the interior is 
equipollent to a force and a couple (in contrast to the assumption made in 

V 

Sec. 3.1) then in addition to the traction T that acts on the surface we must 

V V V 

have also a couple-stress vector M. These two vectors T and M, together, 
are now equipollent to the action of the exterior upon the interior. Simi¬ 
larly, one might have body couples as pointed out by Maxwell, i.e., couple 
per unit mass, c, with components Cy (i = 1, 2, 3). If we accept these pos¬ 
sibilities, then we must define a couple-stress tensor, Aty, in addition to 
the stress tensor 7 y. The tensor Aty is related to the couple-stress vector 
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by a linear transformation like Eq. (3.3:2): 


An analysis of the angular momentum then leads to the equation 

OM :l i , 

r\ t" pC-i -— (‘■ijk'Fjk ) 


dM. xx dM yx dM. zx 

—&r + ~frr + ~di~ +pCx ~ Tyz ~ Tzv ' etc - 

Thus, the antisymmetric part of the stress tensor is determined by the 
body couples and the divergence of the couple-stress tensor. When couples 
of both kinds are absent, the stress tensor must be symmetric. 

Couple-stresses and body couples are useful concepts in dealing with 
submicron structures and molecular mechanics of materials, and in the 
dislocation theory of metals. 


3.5. TRANSFORMATION OF COORDINATES 

In the previous section, the components of stress Ty are defined with 
respect to a rectangular Cartesian system Xi, X 2 , X 3 . Let us now take a 
second set of rectangular Cartesian coordinates x'j, xlj, X 3 with the same 
origin but oriented differently, and consider the stress components in the 
new reference system (Fig. 3.5:1). Let these coordinates be connected by 
the linear relations 

(1) x' k = PkiXi = ~^ x i » = 1,2,3, 

axi 

where /?*.» = cos(x' fc , Xi) are the direction cosines of the x' fc -axis with respect 
to the Xj-axis. Since is a tensor (Sec. 3.3) we can write down the 
transformation law at once. However, in order to emphasize the importance 



Given stresses Desired stresses 


Fig. 3.5:1. Transformation of stress components under rotation of coordinates system. 
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of the result we shall insert an elementary derivation based on Cauchy’s 
formula derived in Sec. 3.3, which states that if dS is a surface element 
whose unit outer normal vector u has components u l) then the force per 
unit area acting on dS is a vector T with components 

Ti = Tjil/j . 

If the normal v is chosen to be parallel to the axis x' fc , so that 

Pi = Pkl ) P2 = /?fc2 , P3 = Pk3 , 
v k 

then denoting Ti as Tj, we have 

Ti = Ti = TjifSkj ■ 

k v 

Since the component of the vector T(= T) in the direction x' m is r fcm , i.e., 
T = Tlmg'm > 


then 

k k 

T = T kmSm = Tjgi = Tjifikjgi , 

where g' m and gi are the unit base vectors of the two coordinate systems. 
Using the relation g f [Eq. (2.14:16)] and Eq. (1) for rectangular 

Cartesian coordinates, we obtain 

T = T km g' m = Tji(3kjPmiZ' m . 


i.e., 

(3) A 




T'jifikjftn 


dx' k dx’ m 
jl dxj dxi 


If we compare Eq. (3) and Eq. (2.5:2) we see that the stress components 
transform like a Cartesian tensor of rank two. Thus, the physical concept 
of stress which is described by Ty agrees with the mathematical definition 


y 



of a tensor of rank two in a Euclidean 
space. 

3.6. PLANE STATE OF STRESS 

A state of stress in which 

(1) T33 = T31 = T32 = 0 , 


Fig. 3.6:1. Change of coordinates in is called a plans state of stress in the 
plane state of stress. aqa^-plane. In this case, the direction 
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cosines between two systems of rectangular Cartesian coordinates can be 
expressed in terms of a single angle #, as shown in Fig. 3.6:1. We have, 

/ cos 8 sin 8 0 \ 

(/3ij) = I —sin 8 cos 8 0 | . 

V 0 0 1/ 

Writing x , y and x' , y' in place of x\, x 2 and x[, x' 2 \ a x for rn; r xy for T\ 2 , 
etc., we have 

a' x = a x cos 2 8 + cr y sin 2 8 + 2r xy sin 8 cos 8 , 
a' — a x sin 2 8 + a y cos 2 8 — 2r xy sin 8 cos 8 , 

T xy = (—&x + a y ) sin 8 cos 8 + r xy (cos 2 8 - sin 2 8). 

Since 

sin 2 # = i(l - cos 28 ), cos 2 # = ^(1 + cos2#), 
we may also write 

<j' x = °- x - + + ° x ■ cos 28 + t xy sin 28 , 

2 2 

(3) a' y = - a - - Ux ^ cos 2# - T xy sin 2# , 

T 'xy ~ ~ 2 ~ s ^ n ^ + Txy C0S ' 

Note that 


(4) 

(5) 

( 6 ) 



The directions given by the particular values of 8 given by (6) are called 
the principal directions ; the corresponding normal stresses are called the 
principal stresses (see Sec. 3.7). Following (5) and (6), the principal stresses 
are extreme values of the normal stresses, 
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T 



er 

axis 


Fig. 3.6 :2. Mohr’s circle in plane state of stress. 

Differentiating r' y with respect to 9 and setting the derivative to zero, we 
can find the angle 9 at which r' y attends its extreme value. This angle is 
easily seen to be ±45° from the principal directions given by (6), and the 
maximum value of r' xy is 



Figure 3.6:2 is a geometric representation of the relations above. It is the 
well-known Mohr’s circle. 

Problem 3.1. Find the transformation law for the moments of inertia and 
products of inertia of an area about a set of rectangular Cartesian coordinates in 
a plane, 



with respect to rotation of the coordinate axes about the origin. Mohr’s circle 
was invented in 1887 for the transformation of the inertia tensor. 

Problem 3.2. Let Vi, i = 1,2,3, be the velocity vector field of a continuum, 
and let vivj be the average value of the product ViVj over a period of time. 
Show that the correlation function vTUJ, with components u 2 , uv, uw, v 2 , etc., in 
unabridged notations, is a symmetric tensor of the second order. 

Problem 3.3. Show that the mass moment of inertia of a set of particles, 

/fj — €ipg Cjfcq ^ .TpZCk dm. , 2, J — 1,2,3 , 

is a tensor, where dm is an element of mass and the integration is extended over 
the entire set of particles. Write out the matrix of the inertia tensor Iij. Show 
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that In (i not summed) is the moment of inertia about the x, axis, whereas Uj 
(i / j) is equal to the negative of the product of inertia about the axes Xi and 
Xj. Show that for a rigid body rotating at an angular velocity oq, the angular 
momentum vector of the body is h 3 ui 3 . 

3.7. PRINCIPAL STRESSES 

The results of the previous section are restricted to the plane state of 
stress. Let us now generalize them to general three dimensional problems. 

In a general state of stress, the stress vector acting on a surface with 
outer normal v depends on the direction of v. Let us ask in what direction 
v the stress vector becomes normal to the surface, on which the shearing 
stress vanishes. Such a surface is called a principal plane, its normal a 
principal axis, and the value of the normal stress acting on the principal 
plane is called a principal stress. 

Let v define a principal axis and let a be the corresponding principal 
stress. Then the stress vector acting on the surface normal to v has compo¬ 
nents cnq. On the other hand, this same vector is given by the expression 
TjiUj. Hence, writing v l = S 3l u 3 , we have, on equating these two expressions 
and transposing them to the same side, 

(1) (t 31 - ad }i )v 3 = 0 . 

The three equations, i = 1,2,3, are to be solved for u\, v 2 , 1 / 3 . Since 
v is a unit vector, we must find a set of nontrivial solutions for which 
u i + v 2 + u 3 ~ 1- Thus, Eq. (1) poses an eigenvalue problem. Since t 3 j as a 
matrix is real and symmetric, we need only to recall a result in the theory 
of matrices to assert that there exist three real-valued principal stresses and 
a set of orthonormal principal axes. Because of the importance of these 
results, we shall give the details of the reasoning below. 

Equation ( 1 ) has a set of nonvanishing solutions iq, u 2 , iq if and only 
if the determinant of the coefficients vanishes, i.e., 

(2) \nj - aSij\ = 0. 

Equation (2) is a cubic equation in cr; its roots are the principal stresses. 
For each value of the principal stress, a unit normal vector v can be 
determined. On expanding Eq. (2), we have 



r u - a 

T12 

7-13 

O&ij | — 

bn 

b 

1 

723 


731 

732 

7"33 — <7 

= • 

-cr 3 + ha 2 - I 2 cr 

O 

II 

+ 


( 3 ) 
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where 

h = Tn + T 22 + t 33 , 

T22 T23 Tn T13 Til Ti2 

h= + + 

732 733 T31 T33 T21 T22 

(4) 

m 7-12 T 13 

Z3 = 7-21 T22 T23 . 

7"31 7-32 T33 

On the other hand, if oq, < 72 , <73 are the roots of Eq. (3), which can be 
written as 

(<7 - <7l)(<7 - <7 2 )(<7 - <7 3 ) = 0 , 

it can be seen that the following relations between the roots and the coef¬ 
ficients must hold: 

/1 = (7i + <72 + <73 > 

(5) Z 2 = <7i <72 + <72(73 + (73171 , 

I 3 — <71<7 2 <73 . 

Since the principal stresses characterize the physical state of stress at a 
point, they are independent of any coordinates of reference. Hence, Eq. (3) 
and the coefficients ii, I 2 , h are invariant with respect to the coordinate 
transformation; Ii, I 2 , h are the invariants of the stress tensor. The impor¬ 
tance of invariants will become evident when physical laws are formulated 
(see, for example, Chapter 6). 

We shall show now that for a symmetric stress tensor the three prin¬ 
cipal stresses are all real and that the three principal planes are mutually 
orthogonal. These important properties can be established when the stress 
tensor is symmetric, 

(6) Tij = Tji . 

12 3 

The proof is as follows. Let u, u, v, be unit vectors in the direction of 

12 3 

the principal axes, with components Ui, v>i, i/i (i = 1, 2, 3) which are the 
solutions of Eq. (1) corresponding to the roots <7i, < 72 , < 73 , respectively; 


( 7 ) 


(Tij & 1 &ij ) Vj — 0 j 

2 

(Tij — 0 , 

3 

(Tij ~0^ij)Vj = 0. 
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2 1 

Multiplying the first equation by Vi, the second by iq, summing over i and 
subtracting the resulting equations, we obtain 

1 2 

(8) (<72 - Ol)ViVi = 0 , 

on account of the symmetry condition (6), which implies that 
12 12 2 1 

(9) TijVjVi = TjiVjVi — TijVjVi . 


The last equality is obtained by interchanging the dummy indices i and j. 

Now, if we assume tentatively that Eq. (3) has a complex root, then, 
since the coefficients in Eq. (3) are real-valued, a complex conjugate root 
must also exist and the set of roots may be written as 


oq = a + i( 3 , <72 = a — i( 3 , <73 , 

where a, /?, <73 are real numbers and i stands for the imaginary number 
1 /— 1 . In this case, Eqs. (7) show that Uj and Vj are complex conjugate to 
each other and can be written as 

1 .,2 ., 

Uj = a,j + ibj , Uj = dj — ibj , 

in which a,j and bj are real numbers and at least one of them is not zero. 
Therefore, 

12 

VjVj = (aj + ibj)(aj — ibj ) 

— <X| -}- (I2 T U3 T hj -f- f>2 T f *3 7^ 0 . 

It follows from (8) that <7i — <72 = 2 i/3 = 0 or f3 = 0. This contradicts the 
original assumption that the roots are complex. Thus, the assumption of 
the existence of complex roots is untenable, and the roots <7j, < 72 , <73 are all 
real. 

When < 7 i 7 ^ cr 2 ^ < 73 , Eq. ( 8 ) implies 
( 10 ) bili = 0 , lili = 0 , lili = 0 ; 

i.e., the principal vectors are mutually orthogonal to each other. If < 7 i = 
cr 2 ^ < 73 , we can determine an infinite number of pairs of orthogonal vectors 
hi and rq and define i L as a vector orthogonal to and iq. If oq = oq = < 73 , 
then any set of orthogonal axes may be taken as the principal axes. 
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If the reference axes x\, x 2l X 3 are chosen to coincide with the principal 
axes, then the matrix of stress components becomes 

/(7i 0 0 \ 

(11) (m) = 0 02 0 . 

\0 0 o 3 J 

3.8. SHEARING STRESSES 

We have seen that on an element of surface with a unit outer normal 

V U V 

is, (vi), there acts a traction T, (Tj = TjiUj). The component of T in the 
direction of v is the normal stress acting on the surface element. Let this 
normal stress be denoted by o („). Since the component of a vector in the 
direction of another vector is given by the scalar product of the two vectors, 
we obtain 

(1) °(n) = TijViUj . 

The magnitude of the resultant shearing stress on a surface element 
having the normal U{ is given by the equation 

( 2 ) t 2 = \Ti\ 2 - of n) , 

(see Fig. 3.8:1). Let the principal axes be chosen as the coordinate axes, 



and let 01 , <7 2 , 03 be the principal stresses. Then 

\Ti\ 2 = (< 7l ^ l) 2 + (02U2) 2 + (o 3 u 3 ) 2 , 

and, from Eq. (1) 


a (n) = M^l) 2 + <^2(^2) 2 + Cr3(^3) 2 ] 2 ■ 



86 


STRESS TENSOR 


Chap. 3 


On substituting into Eq. (2) and noting that 

(id) 2 - (id) 4 = (id) 2 [l - (id) 2 ] = (id) 2 [(id) 2 + M 2 ], 
we see that 

(3) r 2 = (i/i) 2 (id) 2 (<ri - c 2 ) 2 + (id) 2 (id ) 2 (^2 - c 3 ) 2 

+ (^3) 2 (^i) 2 (^3 - ci) 2 • 

If rq = n 2 — l /\/2 and 113 = 0 , then r = ±^(ci — c 2 ) and a n - l(ci + c 2 ). 

Problem 3.4. Show that r max = | (c max — <x m i n ) and that the plane on 
which T m ax acts makes an angle of 45° with the directions of the largest and the 
smallest principal stresses. 

3.9. MOHR’S CIRCLES 

Let Ci, cr 2 , c 3 be the principal stresses at a point. The stress compo¬ 
nents acting on any other surface elements can be obtained by the tensor 
transformation laws, Eq. (3.5:3). Otto Mohr, in papers published in 1882 
and 1900, has shown the interesting result that if the normal stress C( n ) and 
the shearing stress r acting on any surface element be plotted on a plane, 



Fig. 3.9:1. Mohr’s circles. 

with a and r as coordinates as shown in Fig. 3.9:1, the locus necessarily 
falls in a closed domain represented by the shaded area bounded by the 
three semicircles with centers on the a-axis. A detailed proof can be found 
in Westergaard, 1 ' 2 Elasticity and Plasticity, pp. 61-64; or Sokolnikoff, 1,2 
Elasticity , p. 52. The practical problem of graphical construction of Mohr’s 
circle from strain-gage data is discussed in Biezeno and Grammel, 1,2 Engi¬ 
neering Dynamics , Vol. 1, p. 245; Pearson, 1 ' 4 Theoretical Elasticity, p. 64. 
See Bibliography on pp. 873-876. 
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3.10. STRESS DEVIATIONS 
The tensor 

(1) A T { j = Tij — aoSij , 

is called the stress deviation tensor , where Sij is the Kronecker delta and 
cq is the mean stress 

(2) a 0 = ^(<71 + CT 2 + cr 3 ) = i(rn + t 22 + r 33 ) = ^/i, 

where I\ is the first invariant of Sec. 3.7 and r-j specifies the deviation of 
the state of stress from the means stress. 

The first invariant of the stress deviation tensor always vanishes: 

( 3 ) I[ = t' u + t 22 + T33 = 0 . 

To determine the principal stress deviations, the procedure of Sec. 3.7 may 
be followed. The determinental equation 

( 4 ) \t[ 3 - a'Sijl = 0 . 

may be expanded in the form 

( 5 ) < t ' 3 - J 2 a' - J 3 = 0 . 

It is easy to verify the following equations relating J 2 , J3 to the invariants 
hi h defined in Sec. 3.7, 

(6) h = 3<Jq - h , 

(7) h = h — hao + 2<Jq = h + hao ~ ajj , 
and the alternative expressions below on account of Eq. (3), 

(8) J 2 = -t' u t^ 2 - t' 22 t ' 33 - r^T' n + (r 12 ) 2 + (r 23 ) 2 + (r 3 i) 2 

= ^[( T n ) 2 + ( r 22) 2 + ( r 3 s ) 2 ] + ( r i 2 ) 2 + ( r 23 ) 2 + ( t 3 i ) 2 

= ^[(ni - t 22 ) 2 + (r 22 - r 33 ) 2 + (r 33 - Tn) 2 ] 

+ (ti 2 ) 2 + (t 23 ) 2 + (t 3 i ) 2 
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The To in the last equation is the octahedral stress , which will be defined in 
the next section. 

We note also the simple expressions 

(9) A J'2 — 2 T ij T ij > 

(10) A h = \lijTjk Tfci • 

It can be easily shown that the principal stress deviations are 

(11) a’i = CTi - (T 0 ■ 


Problem 3.5. Show that the principal stresses as given by the three roots 
of Eq. ( 5 ) can be written as 



where cos 3 a = J3 %/ 2 /to , and J2 = 3 tq / 2 . 

3.11. OCTAHEDRAL SHEARING STRESS 

The octahedral shearing stress To is the resultant shearing stress on a 
plane that makes the same angle with the three principal directions. Such 
a plane is called an octahedral plane ; eight such planes can form an octahe¬ 
dron. See Fig. 3.11:1. The direction cosines 1 \ of a normal to the octahedral 



Fig, 3.11:1. Octahedral planes. 
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plane relative to the principal axes are such that 

("if = M 2 = fa ) 2 = ^ • 

Hence, Eq. (3.8:3) gives 

9tq - (cti - <r 2 ) 2 + (<J 2 - CT3) 2 + (03 - o-i) 2 , 

which is proportional to the sum of the areas of Mohr’s three semicircles. 
Prom Eqs. (3.10:6) and (3.10:8), it can be easily verified that the octahedral 
stress can be expressed in terms of the two invariants I\ and / 2 of Sec. 3.7, 

9t 2 = 2lj - 6/ 2 . 

The square of the octahedral stress happens to be proportional to the 
second invariant J 2 of the stress deviation, Eqs. (3.10:6) and (3.10:8). In 
1913, Richard von Mises proposed the hypothesis that yielding of some of 
the most important materials occurs at a constant value of the quantity J 2 . 
Nadai then introduced the interpretation of J 2 as proportional to the octa¬ 
hedral shearing stress. In this way, J 2 or t 0 enters into the basic equations 
of plasticity. Note that the normal stress on the octohedral plane equals 
the mean stress. 

Problem 3 . 6 . If a\ > cr 2 > 03 and a 1, <73 are given, at what values of cr 2 
does To attain its extreme values? 


y 
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Problem 3.7. Let cr x = —5c, a v = c, u z — c, r xy = —c, r yz = t zx = 0, 
where c = 1,000 lb/sq in. Determine the principal stresses, the principal stress 
deviations, the direction cosines of the principal directions, the greatest shearing 
stress, and the octahedral stress. 

Problem 3.8. Consider a horizontal beam as shown in Fig. P3.8. According 
to the usual elementary theory of bending, the “fiber stress” is a xx = — 12 My/bh 3 , 
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where M is the bending moment which is a function of x. Assume this value of 
oxx , and assume further that o zz = o zx = a zy = 0, that the body force is absent, 
that a xy --= 0 at the top and bottom of the beam (y = ±h/ 2), and that a yy = 0 
at the bottom. Derive a xy and a yy from the equations of equilibrium. Compare 
the results with those derived in elementary mechanics of materials. 

3.12. STRESS TENSOR IN GENERAL COORDINATES 

So far we have discussed the stress tensor in rectangular Cartesian coor¬ 
dinates, in which there is no necessity to distinguish the contravariant and 
covariant transformations. The necessary distinction arises in curvilinear 
coordinates. 

Just as a vector in three-dimensional Euclidean space may assume either 
a contravariant or a covariant form, a stress tensor can be either contravari¬ 
ant, r y , or mixed, rj, or covariant, r z j. The tensors , rj, and r z j are 
related to each other by raising or lowering of the indices by forming inner 
products with the metric tensors g tJ and g l] : 

t] = 9ajT ia = g a jT al , 

(1) Tij = g ia rf , 

= g la Tl. 

The correctness of these tensor relations are again seen by specializing them 
into rectangular Cartesian coordinates. We have seen that t zj is symmetric 
in rectangular Cartesian coordinates. It is easy to see from the tensor trans¬ 
formation law that the symmetry property remains invariant in coordinate 
transformation. Hence, r ij = T yl in all admissible coordinates and we are 
allowed to write the mixed tensor as rj and not r l : or t/. But what is the 
physical meaning of each of these components? 

To clarify the meaning of the stress components in an arbitrary curvi¬ 
linear coordinates system, let us first consider a geometric relationship. Let 
us form an infinitesimal tetrahedron whose edges are formed by the coor¬ 
dinate curves Pl\ , PP2, PP:\ and the curves P1P2, P2P3, P3P1, as shown 
in Fig. 3.12:1. Let us write, vectorially, 

PPi = ri , PP 2 = r 2 , PP 3 = r 3 . 


Then 

P1P2 = r 2 - ri, P\P 3 = r 3 — rj , P 2 P 3 = r 3 - r 2 , 


t Similarly, the contravariant stress tensor r ,J is symmetric, T tJ — r J *. It does not make 
sense, however, to say that the mixed tensor rj is symmetric, since an equation like 
rj = t? , with the indices switching roles on the two sides of the equation, is not a tensor 
equation. 
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x* 



Fig. 3.12:1. Geometric relationship. 


and we have 

( 2 ) PiP 2 x P X P 2 = (r 2 - it) x (r 3 - it) 

= -ri x r 3 - r 2 x ri + r 2 x r 3 
= r 2 x r 3 + r 3 x ri + r! x r 2 . 

Now the vector product A x B of any two vectors A and B is a vector 
perpendicular to A and B, whose positive sense is determined by the right- 
hand screw rule from A to B, and whose length is equal to the area of a 
parallelogram formed by A, B as two sides. Hence, if we denote by u, iq, u 2 , 
i/ 3 the unit vectors normal to the surfaces P\P 2 P 2 , PP 2 P 2 , PP 3 P 1 , PP\P 2 , 
respectively, and by dS, dSi, dS 2 , dS 2 their respective areas, Eq. (2) may 
be written as 

(3) i/dS = V\dSi 4- v 2 dS 2 + i/ 3 d5' 3 . 

Now let us recall that in Sec. 2.14, we defined the reciprocal base vectors 
g 1 , g 2 : g 3 which are perpendicular to the coordinate planes and are of 
length \fg lx , \fg 22 , 1 /p 33 , respectively. We see that the unit vectors iq, 
i/ 2 j r *3 are exactly g 1 /v / V 3 ) g 2 /\/g^, g :! //p 33 , respectively. Hence, 
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If the unit normal vector u is resolved into its covariant components with 
respect to the reciprocal base vectors, then 

(5) v - . 

We see from the last two equations that 

(6) ▲ Vi\fg™ dS = dS{ , i not summed. 

This is the desired result. 

Let us now consider the forces acting on the external surface of the 
infinitesimal tetrahedron. In Sec. 3.1, the inside and outside of a volume 
are distinguished by drawing an outward pointing normal vector. The stress 

V 

vector T is defined as the limit of force acting on the outside surface with 
a normal v divided by the area of the surface. On the other side, the stress 
vector is —T, see Sec. 3.3, Fig. 3.3:1. Now, for the tetrahedron shown 
in Fig. 3.12:1, the normal vector v of the triangle P 1 P 2 P 3 is outward, the 

V V 

stress vector is T, the area is dS, the force is T dS. On the triangle PP 2 P 3 , 

the normal vector Vi points inward, the stress vector is T, the area is dS{, 

i 

the force is, therefore, — T dSi as shown in Fig. 3.12:1. The forces on the 
other surfaces are determined similarly. The equation of motion of this 
infinitesimal tetrahedron is, in the limit, 

(7) TdS = T dSi. 

Volume forces and inertia (mass x acceleration) forces acting on the tetra¬ 
hedron do not enter into this equation, because they are of higher order of 
smallness than the surface forces. On substituting (6) into (7) and canceling 
the nonvanishing factor dS, we have 

3 . 

(8) T = 

i=l 

If the coordinates x 1 are changed to a new set x l while the surface 
P 1 P 2 P 3 and the unit outer normal v remain unchanged, the stress vector 

T is invariant, but the vectors T will change because they will be associated 
with the new coordinate surfaces. (See Fig. 3.12:2, in which only one pair 
2 2 

of vectors T, T' are shown.) The covariant components v% will change 
also. Inasmuch as T is invariant and Vi is a covariant tensor, Eq. (8) shows 
that T yfg^ transforms according to a contravariant type of transformation. 
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Fig. 3.12:2. Tractions referred to two different elementary tetrahedron with a common 
surface. 

Resolving the vectors T y/g^ into their components with respect to the 
base vectors gj and the reciprocal base vectors g l , we have 

(9) ▲ yfg* T = t v gj — Tjgi , i not summed. 

V 

On the other hand, the components of T may be written as 

(10) T = T j gj = Tjg’ . 

Substitution of (9) and (10) into (8) shows that 

(11) A T j = , Tj = r> t . 

The tensorial character of r 1J and rj are demonstrated both by (9) and by 

(11) according to the quotient rule. Following the tensor transformation 
... v 

rules for r IJ , rj and Vi, we can rewrite Eq. (11) in the form (12) T J = rf v 1 , 

V 

Tj = TijV t where v x are the contravariant components of u. 

Let us recapitulate the important points. With the scaling factor 

the vectors T y/g** (i = 1,2,3, not summed) transform according to the 
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contravariant rule. When the vectors T \fg** are resolved into components 
along the base vectors g 3 , the tensor components T t] are obtained. When 
the resolution is made with respect to the reciprocal base vectors g J , then 
the mixed tensor rj is obtained. On the other hand, if the stress vector 

V 

T (acting on the triangle P 1 P 2 -P 3 with outer normal u) is resolved into 
the contravariant components T' J along the base vectors gj and the covari¬ 
ant components T j along the reciprocal base vectors g J , then we obtain 
Eq. ( 11 ). 

The contravariant stress tensor r 13 and the mixed stress tensor rj are 

i 

related to the stress vectors T by the Eq. (9). The covariant stress tensor 

i 

Tij defined in Eq. (1) cannot be so simply related to the vectors T and are 
therefore of less importance. 


3.13. PHYSICAL COMPONENTS OF A STRESS TENSOR IN 
GENERAL COORDINATES 

If we write Eq. (3.12:9) as 


(1) 




3 



i not summed. 


Then, since gj/y/gjj (j not summed) are unit vectors along the coordinate 
curves, the components a 13 are uniform in physical dimensions and repre¬ 
sent the physical components of the stress vector T in the direction of the 
unit vectors g j /\JgJj, 


(2) 



i,j not summed. 


But a 13 is not a tensor. 

On the other hand, if we use the mixed tensor rj in (3.12:1), we have 


(3) 


U ^ 



i not summed. 



Thus 

(4) 




j not summed, 
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are the physical components of the tensor rj, of uniform physical dimen¬ 
sions, representing the components of the stress vector T resolved in the di¬ 
rections of the reciprocal base vectors. Note that, in general, a 13 ^ < 7 * ^ a \, 
except for orthogonal coordinates. If the coordinates are orthogonal, g l and 
g i are in the same direction. 

When curvilinear coordinates are used, we like to retain the liberty of 
choosing coordinates without regard to dimensions. Thus, in cylindrical 
polar coordinates ( r,9,z ), r and 2 have the dimensions of length and 9 is 
an angle. The corresponding tensor components of a vector referred to polar 
coordinates will have different dimensions. For physical understanding it 
is desirable to employ the physical components, but for the convenience of 
analysis it is far more expedient to use the tensor components. 

3.14. EQUATIONS OF EQUILIBRIUM IN CURVILINEAR 
COORDINATES 

In Sec. 3.4 we discussed the equations of equilibrium in terms of Carte¬ 
sian tensors in rectangular Cartesian coordinates. To obtain these equa¬ 
tions in any curvilinear coordinates, it is only necessary to observe that the 
equilibrium conditions must be expressed in a tensor equation. Thus the 
equations of equilibrium must be 

(1) t 1 ' 3 \j + X 1 ~ 0, in volume, 

(2) T 3l Vi -= T l , on surface. 

The truth is at once proved by observing that these are truly tensor equa¬ 
tions and that they hold in the special case of rectangular Cartesian coor¬ 
dinates. Hence, they hold in any coordinates that can be derived from the 
Cartesian coordinates through admissible transformations. 

The practical application of tensor analysis in the derivation of the equa¬ 
tions of equilibrium in particular curvilinear coordinates will be illustrated 
in Secs. 4.11 and 4.12. It will be seen that these lengthly equations can be 
obtained in a routine manner without too must effort. Because the manipu¬ 
lation is routine, chances of error are minimized. This practical application 
may be regarded as the first dividend to be paid for the long process of 
learning the tensor analysis. 

Problem 3.9. Let us recast the principal results obtained above into ten¬ 
sor equations in general coordinates of reference. In rectangular Cartesian co¬ 
ordinates, there is no difference in contravariant and covariant transformations. 
Hence, the Cartesian stress tensor may be written as m, or T l \ or rj. In general 
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frames of reference, r,j, r'\ r* are different. Their components may have different 
values. They are different versions of the same physical entity. Now prove the 
following results in general coordinates. 

(a) The tensors t ' 3 , are symmetric if there is no body moment acting on the 
medium; i.e., 

T ij = T 33 , Tij = Tji . 

(b) Principal planes are planes on which the stress vector T is parallel to the 
normal vector v. If we use contravariant components, we have, on a principal 
plane, T 3 = av 3 = ag %3 v j = T'*Vi, where u is a scalar. If we use covariant 
components, we have correspondingly T } = aUj = crg]ui = rj Ui . Show that 
a must satisfy the characteristic determinantal equation 

|rj — <j<5* | = 0 or its equivalent |r ,J — ag ' 3 1 = 0 . 

(c) The first invariant of the stress tensor is r t ? , or T lj gij. However, Ta and t" 
are in general not invariants. 

(d) The stress deviation tensor s‘ is defined as 



The first invariant of s'j zero. The second invariant has the convenient form 

J 2 = i4s, fc . 

(e) The octahedral shearing stress has the same value in any coordinates system. 
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In this chapter we shall consider the deformation of a body as a “mapping” 
of the body from the original state to the deformed state. Strain tensors 
which are useful for finite strains as well as infinitesimal strains are then 
defined. 

4.1. DEFORMATION 

In the formulation of continuum mechanics the configuration of a solid 
body is described by a continuous mathematical model whose geometrical 
points are identified with the place of the material particles of the body. 
When such a continuous body changes its configuration under some phys¬ 
ical action, we impose the assumption that the change is continuous; i.e., 
neighborhoods are changed into neighborhoods. Any introduction of new 
boundary surfaces, such as caused by tearing of a membrane or fracture of a 
test specimen, must be regarded as an extraordinary circumstance requiring 
special attention and explanation. 

Let a system of coordinates ai, a 2 , 0,3 be chosen so that a point P of 
a body at a certain instant of time is described by the coordinates a;(i = 
1 , 2,3). At a later instant of time, the body is moved (deformed) to a new 
configuration; the point P is moved to Q with coordinates Xi(i = 1,2,3) 
with respect to a new coordinate system x\, X2, £ 3 . The coordinate systems 
a i> a 2 ) °3 and * 1 , X2, x 3 may be curvilinear and need not be the same 
(Fig. 4.1:1), but they both describe a Euclidean space. 

The change of configuration of the body will be assumed to be contin¬ 
uous, and the point transformation ( mapping ) from P to Q is assumed to 
be one-to-one. The equation of transformation can be written as 

( 1 ) Xi = Xi(a\ , a 2 , a^) , 

which has a unique inverse 

( 2 ) ai=&i(x 1 ,x 2 ,x 3 ), 

for every point in the body. The functions Xi(ai,a 2 ,a 3 ) and a,i(xi,x 2 ,x 3 ) 
are assumed to be continuous and differentiable. 
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Fig. 4.1:1. Deformation of a body. 

We shall be concerned with the description of the strain of the body, 
i.e., with the stretching and distortion of the body. If P , P', P" are three 
neighboring points forming a triangle in the original configuration, and if 
they are transformed to points Q , Q\ Q" in the deformed configuration, 
the change in area and angles of the triangle is completely determined if we 
know the change in length of the sides. But the “location” of the triangle 
is undetermined by the change of the sides. Similarly, if the change of 
length between any two arbitrary points of the body is known, the new 
configuration of the body will be completely defined except for the location 
of the body in space. In the following discussions our attention will be 
focused on the strain of the body, because it is the strain that is related 
to the stress. The description of the change in distance between any two 
points of the body is the key to the analysis of deformation. 

Consider an infinitesimal line element connecting the point P(ai,a 2 > a 3 ) 
to a neighboring point P'(ai + da 1 , a .2 + da 2 , 0,3 + da 3 ).* The square of the 
length dso of PP' in the original configuration is given by 

(3) o!sq = UijdiPdP , 

where ajj, evaluated at the point P, is the Euclidean metric tensor for the 
coordinate system a*. When P, P' is deformed to the points Q(x 1 , £ 2 , 2 : 3 ) 
and Q'(xi + dx 1 , x 2 + dx 2 , £3 + dx 3 ), respectively, the square of the length 
ds of the new element QQ' is 

(4) ds 2 = gtjdx'dxi , 

where gij is the Euclidean metric tensor for the coordinate system X{. 

* As remarked before in a footnote in Sec. 2.4, the point transformation between a; and 
Xi does not follow tensor transformation law and the position of the indices has no sig¬ 
nificance. But the differentials da * and dx i do transform according to the contravariant 
tensor law and are contravariant vectors. 
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By Eqs. (1) and (2), we may also write 

( 5 ) dsl = aij^-^y-dx l dx m , 

(6) ds 2 = gij ^-da l da m . 

Oai UQyyi 

The difference between the squares of the length elements may be written, 
after several changes in the symbols for dummy indices, either as 

(7) ds 2 - dsl = “ a H ) da l da J , 

or as 

(8) ds 2 - dsl = (dij ~ dtfdx 3 . 

We define the strain tensors 


(9) A 

(10) A 
so that 


Eij 


eij 


1 

2 

1 

2 


dx a 8 xq 
9a(5 “o a i 

oai dad 


9ij O'a (3 


da a dap\ 
dxi dxj ) 


(11) A ds 2 — dsl — 2Eijda l dai , 

(12) A ds 2 — dsl = Zeijdx'dxi . 

The strain tensor Eij was introduced by Green and St. Venant, and is 
called Green’s strain tensor. The strain tensor eij was introduced by Cauchy 
for infinitesimal strains and by Almansi and Hamel for finite strains, and 
is known as Almansi’s strain tensor. In analogy with a terminology in 
hydrodynamics, Eij is often referred to as a strain tensor in Lagrangian 
coordinates and as a strain tensor in Eulerian coordinates. 

That Eij and e l3 thus defined are tensors in the coordinate systems 
Gi and Xi, respectively, follows from the quotient rule when it is applied 
to Eqs. (11) and (12). The tensorial character of Eij and e i3 can also be 
verified directly from their definitions (9), (10) by considering further coor¬ 
dinate transformations in either the original configuration (from 01 , 02,03 
to 01 , 02 , 03 ), or the deformed configuration (from £ 1 , 2 : 2 , £3 to $ 1 , 2 : 2 , * 3 )- 



100 


ANALYSIS OF STRAIN 


Chap. 4 


The details are left to the reader. The tensors Eij and e l} are obviously 
symmetric , i.e., 


(13) A Eij — Eji j eij — eji. 

An immediate consequence of Eqs. ( 11 ) and ( 12 ) is the fundamental 
result that a necessary and sufficient condition that a deformation of a 
body be a rigid-body motion (consists merely of translation and rotation 
without changing distances between particles) is that all the components of 
the strain tensor Eij or e l3 be zero throughout the body. 

In the discussion above we have used two sets of curvilinear coordi¬ 
nates to describe the position of each particle. One, < 21 , 02 , 0 . 3 , is used in 
the original configuration, the other, xi,X 2 ,xz, is used in the deformed 
configuration. 

Now, there are two particularly favored choices of coordinates: 

I. We use the same rectangular Cartesian coordinates for both the orig¬ 
inal and the deformed configurations of the body. In this case, the metric 
tensors are extremely simple: 


(14) 


9ij 


= 6i 


II. We distort the frame of reference in the deformed configuration 
in such a way that the coordinates Xi,X 2 ,x$, of a particle have the same 
numerical values 01 , 02,03 as in the original configuration. In this case, 
Xj = oi, dx a /dai = S a i, da a /dxi = 5 a i, and Eqs. (9) and ( 10 ) are reduced 
to 


(15) 


Eu = 




Thus all the information about strain is contained in the change of the 
metric tensor as the frame of reference is distorted from the original con¬ 
figuration to the deformed configuration. In many ways this is the most 
convenient choice in the study of large deformations. The coordinates so 
chosen are called convected or intrinsic coordinates. 

In the following sections we wish to discuss the meaning of the individual 
components of the strain tensor. For this purpose the choice (I) above is 
the most appropriate. 


4.2. STRAIN TENSORS IN RECTANGULAR CARTESIAN 
COORDINATES 

If we use the same rectangular Cartesian (rectilinear and orthogonal) 
coordinate system to describe both the original and the deformed 
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Fig. 4.2:1. Displacement vector, 
configurations of the body, then 

( 1 for i = j , 

(1) g ij =a ij = 6 ij = j Q for ijLj . 

Furthermore, if we introduce the displacement vector i 
components 

(2) Ui = Xi — ai , i = 1,2,3, 

(see Fig. 4.2:1) then 


dx^ 


+ S 0 


da i da 

The strain tensors reduce to the simple form 
1 


da& ^ diia 
dxi ai dxi 


(3) A 


Eh = 


r dXa dxp - 


du, 


5 <*P ' ~da~ + dai ) ( *77 + 5 Pi 


dup 


dai 


duj dui du a du a 

dai + daj + dai daj J ’ 


and 
(4) A 


da a dap 

6ij ~ Sap d^d^ 


Sij SaP { d^ + Jai ) ( dx 3 + Spj ) 


duj dui du a du a 

dxj dxj dxi dxj J 


with 
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In unabridged notations (a:, y, z for X\, X 2 , £ 3 ; a, b, c for m, a?, 03 ; and u,v, w 
for ui,U 2 ,U 3 ), we have the typical terms 


(5) 




E a b 


e X y 


1 du dv f du du dv dv dw dw \ 

2 db da + \ da db + da db da db ) 

1 du dv / du du dv dv din dw \ 

2 dy dx \ dx dy dx dy ^ dx dy ) 


Note that u, v, w are considered as functions of a, b, c, the position of points 
in the body in unstrained configuration, when the Lagrangian strain tensor 
is evaluated; whereas they are considered as functions of x, y, z, the position 
of points in the strained configuration, when the Eulerian strain tensor is 
evaluted. 

If the components of displacement Ui are such that their first derivatives 
are so small that the squares and products of the partial derivatives of u* 
are negligible, then ey is reduced to Cauchy’s infinitesimal strain tensor, 


( 6 ) A 


1 duj dui 

2 dxi dxj 


In unabridged notation, 


(7) 


du 1 I du dv\ 

6xx = ~dx ’ Cxy = 2 + Tx) = 6yx ’ 

dv 1 f du dw\ 

e vy = dy' e « = 2 val + fa) = e * x ’ 

dw l (dv dw\ 

= Cyz = 2 \d~ z + faj) =Czy ' 


In the infinitesimal displacement case, the distinction between the La¬ 
grangian and Eulerian strain tensors disappears, since then it is immaterial 
whether the derivatives of the displacements are calculated at the position 
of a point before or after deformation. 
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4.3. GEOMETRIC INTERPRETATION OF INFINITESIMAL 
STRAIN COMPONENTS 

Let x, y, z be a set of rectangular Cartesian coordinates. Consider a line 
element of length dx parallel to the x-axis (dy = dz = 0). The change of 
the square of the length of this element due to deformation is 

ds 2 - dsl — 2 e xx (dx) 2 . 


Hence, 


ds — dso 


2e xx (dx) 2 
ds ds o 


But ds = dx in this case, and ds Q differs from ds only by a small quantity 
of the second order, if we assume the displacements u , v, w and the strain 



Fig. 4.3:1. Deformation gradients and interpretation of infinitesimal strain components. 
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components to be infinitesimal. Hence, 
ds — dso 

i ^xx 1 

ds 

and it is seen that e xx represents the extension, or change of length per 
unit length of a vector parallel to the rc-axis. An application of the above 
discussion to a volume element is illustrated in Fig. 4.3:1, Case 1. 

To see the meaning of the component e xy , let us consider a small rect¬ 
angle in the body with edges dx,dy. It is evident from Fig. 4.3:1, Cases 2, 
3, and 4 that the sum du/dy + dv/dx represents the change of angle xOy 
which was originally a right angle. Thus, 

du dv\ 1 

fry + dx) = 2 ( change of angle X ° V > ' 

In engineering usage, the strain components (i ^ j) doubled, i.e., 2ey, 
are called the shearing strains, or detrusions. The name is perhaps partic¬ 
ularly suggestive in Case 3 of Fig. 4.3:1, which is called the case of simple 
shear. 

The quantity 

1 / dv du\ 

Wz 2 \9a; dy J 

is called the infinitesimal rotation of the element dxdy. This terminology 
is suggested by Case 4 in Fig. 4.3:1. If 

dv du 

dx dy ’ 

then e xy = 0 and u z is indeed the angle through which the rectangular 
element is rotated as a rigid body. 



4.4. ROTATION 


Consider an infinitesimal displacement field Ui(xi, x^, x^). From u t , 
form the Cartesian tensor 


(1) 


1 ( du j dui\ 

ljJlJ 2 \ dxi dxj J 


which is antisymmetric, i.e., 


( 2 ) Wij = -Wji. 

In three dimensions, from an antisymmetric tensor we can always build a 
dual vector, 

(3) u k = -ekijUij, i.e., u;=-curlu, 
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where ekij is the permutation tensor (Secs. 2.1 and 2.6). On the other hand, 
from Eq. (3) and the e - S identity [Eq. (2.1:11)] it follows that eijkWk = 
— Vji). Hence, if ui{j is antisymmetric, the relation Eq. (3) has a 
unique inverse, 

(4) Ulij — • 

Thus, u>ij may be called the dual (antisymmetric) tensor of a vector u>k- We 
shall call u>k and u>ij, respectively, the rotation vector and rotation tensor 
of the displacement field w*. 

We shall consider the physical meaning of these quantities below. 

At the end of Sec. 4.3 we saw that u> z represents the (infinitesimal) 
rotation of the element as a rigid body if e xy vanishes. Now we have the 
general theorem that the vanishing of the symmetric strain tensor Eij or eij 
is the necessary and sufficient condition for a neighborhood of a particle to 
be moved like a rigid body. This follows at once from the definitions of strain 
tensors, Eqs. (4.1:11) and (4.1:12). For, if a neighborhood of a particle P 
moves like a rigid body, the length of any element in the neighborhood will 
not change, so that ds — ds 0 . It follows that — e<j = 0, because, for a 
symmetric strain tensor Eij, there exists a coordinates system that 

ds 2 - dsl = Ai da\ + A^dal + Azda 2 , 


where A; are the eigenvalues of Eij. If ds = ds 0 for any line element i.e., 
dso can just equal dai, da^ or da?,, it implies A, (i = 1,2,3) are zero which 
in turn implies that Eij = 0. The proof for = 0 is the same. 

Conversely, if Eij or ey vanishes at P, the length of line elements joining 
any two points in a neighbourhood of P will not change and the neighbor¬ 
hood moves like a rigid body. 

We can show that an infinitesimal displacement field in the neighbor¬ 
hood of a point P can be decomposed into a stretching deformation and a 
rigid-body rotation. To show this, consider a point P' in the neighborhood 
of P. Let the coordinates of P and P' be Xi and Xi+dxi, respectively. The 
relative displacement of P' with respect to P is 

(5) dui = ^^-dxj. 

dXj 


This can be written as 

, 1 (dui duj \ 1 

du ' = 2{e^ + S^) dxi + 2 

— etjdxj uJijdxj . 


dui 

dxi 


duj \ 
dxi) 


dxj 
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The first term in the right-hand side represents a stretching deformation 
which involves both stretching and rotation. The second term can be writ¬ 
ten as 

—u)ijd,Xj = uijidXj 

( 6 ) = -eijkUkdxj 

= (cj x dx ) i . 

Thus the second term is the vector product of oj and dx. This is exactly 

what would have been produced by an infinitesimal rotation |to| about an 

axis through P in the direction of u>. 

It should be noted that we have restricted ourselves to infinitesimal 
displacements. Angular and strain measures for finite displacements are 
related to the deformation gradient in a more complicated way. 

4.5. FINITE STRAIN COMPONENTS 

When the strain components are not small, it is no longer possible to give 
simple geometric interpretations of the components of the strain tensors. 

Consider a set of rectangular Cartesian coordinates with respect to 
which the strain components are defined as in Sec. 4.1. Let a line ele¬ 
ment before deformation be da 1 = dso, da 2 = 0, da 3 = 0. Let the extension 
E\ of this element be defined by 

ds - ds 0 

or 

(1) ds = (1 + Ei)dso . 

From Eq. (4.1:11), we have 

(2) ds 2 — ds o = 2 Eijda L da : > = 2E\\{da 1 ) 2 . 

Combining (1) and (2), we obtain 

(1 + E x ) 2 - 1 = 2E n , 


(3) E\ = \J\ + 2 Eyi — 1. 
This reduces to 

(4) 

if En is small. 


Ei = En i 
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To get the physical significance of the component Eu, let us consider 
two line elements dso and ds 0 which are at a right angle in the original 
state: 


( 5 ) 


dso : da 1 = dso , da 2 = 0 , da 3 — 0; 

dso : da 1 = 0, da 2 = dso , da 3 = 0 . 


After deformation these line elements become ds, (dx l ) and ds, (dx l ). 
Forming the scalar product of the deformed elements, we obtain 

dsds cos 0 = dx k dx k = ~^-da l -~-da : ’ 

(jCb'l C/CLj 

dx k dx k , 

= —a— ds 0 ds 0 . 

OCL\ 00,2 

The right-hand side is related to the strain component E\ 2 according to 
Eq. (4.1:9), on specializing to rectangular Cartesian coordinates. Hence 

(6) ds ds cos 0 = 2 Ei 2 dsodso . 

But, from Eqs. (1) and (3), we have 


ds = \/l + 2Endso , ds = ^/l + 2 E ^2 dso . 
Hence, Eq. (6) yields 

2£i2 


( 7 ) 


cos 6 = 


i/l + 2 En \/l + 2E22 


The angle 9 is the angle between the line elements ds and ds after defor¬ 
mation. The change of angle between the two line elements, which in the 
original state dso and d§o are orthogonal, is 012 = ir/2 — 9. From Eq. (7) 
we therefore obtain 

/o', . 2Ei2 

(8) 12 = Trrsnvr+IKr 

This reduces, in the case of infinitesimal strain, to 

( 9 ) «12 = %Ei2 . 

A completely analogous interpretation can be made for the Eulerian 
strain components. Defining the extension ei per unit deformed length as 


( 10 ) 


ei = 


ds — dso 
ds 
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we find 


( 11 ) 


ei — 1 - \/l — 2en . 


Furthermore, if the deviation from a right angle between two elements in 
the original state which after deformation become orthogonal be denoted 
by P 12 , we have 


( 12 ) 


sin/?i2 


_ 2ei2 _ 

\/l — 2en \/l — 2e22 


These again reduce to the familiar results 


ei = e n , Pi 2 = 2 ei 2 , 

in the case of infinitesimal strain. 


4.6. COMPATIBILITY OF STRAIN COMPONENTS 

The question of how to determine the displacements tq when the compo¬ 
nents of strain tensor are given naturally arises. In other words, how do we 
integrate the differential equations (in rectangular Cartesian coordinates) 


( 1 ) 


e ji — e ij ~ 


1 

2 


duj dui 
dxi dxj 


du a du a 
dxi dxj 


(i,j = 1,2,3) 


to determine the three unknows 111 ,^ 2 , 1 x 3 ? 

Inasmuch as there are six equations for three unknown functions it,, the 
system of Eqs. (1) will not have a single-valued solution in general, if the 
functions ey were arbitrarily assigned. One must expect that a solution 
may exist only if the functions e %3 satisfy certain conditions. 

Since strain components only determine the relative positions of points 
in the body, and since any rigid-body motion corresponds to zero strain, we 
expect the solution iq can be determined only up to an arbitrary rigid-body 
motion. But, if e,j were specified arbitrarily, we may expect that something 
like the cases shown in Fig. 4.6:1 may happen. Here a continuous triangle 
(portion of material in a body) is given. If we deform it by following an 
arbitrarily specified strain field starting from the point A, we might end 
at the points C and D either with a gap between them or with overlap¬ 
ping of material. For a single-valued continuous solution to exist (up to a 
rigid-body motion), the ends C and I) must meet perfectly in the strained 
configuration. This cannot be guaranteed unless the specified strain field 
along the edges of the triangle obeys certain conditions. 
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Unstrained 



Strained 



Strained 


Fig. 4.6:1. Illustrations for the requirement of compatibility. 


The conditions of integrability of Eqs. (1) are called the compatibility 
conditions. They are conditions to be satisfied by the strain components 
eij and can be obtained by eliminating from Eqs. (1) 

The nonlinear differential Eqs. (1) are difficult to handle, so first let us 
consider the linear infinitesimal strain case 


( 2 ) 


Bij 


1 / dm dv.j \ 

2 dx*) 


i.e., eij — g ( u *d d - > 


where an index i following a comma indicates partial differentiation with 
respect to Xi. By differentiation of Eq. (2), we have 


&ij t kl — 2 (M*>jkl ~b Ujjki) . 
Interchanging subscripts, we have 


^ kl,ij — 2 (ttfe.lzj ~b Ui'icij') , 

1 . 

— 2 v J.Lt: ~b Ui'jik) , 

1 , 

&ik,jl — g \U‘i,kjl ~b Uk } iji) . 

Prom these we verify at once that 


(d) A eij'kl “b eirfjj Cik.jl Cjl,ik “— 0 • 

This is the equation of compatibility of St. Venant, first obtained by him 

in 1860. 

Of the 81 equations represented by Eq. (3), only six are essential. The 
rest are either identities or repetitions on account of the symmetry of and 
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of ei The six equations written in unabridged notations are 

d 2 e xx _ d_ ( de yz de zx de xy \ 

dydz dx \ dx dy dz ) ’ 

9 2 e yy _ d_ / de zx de xy de yz \ 

dzdx dy \ dy dz dx ) 

& ^zz _ / de xy de yz dc zx \ 

dxdy dz \ dz dx dy J ’ 

(4) A d 2 e xy _ cPe XX d &yy 

dxdy dy 2 dx 2 ’ 

2 d2 e yz = d 2 e yy d 2 e zz 

dydz dz 2 dy 2 

2 d 2 e zx _ d 2 e zz d 2 e xx 

dzdx dx 2 dz 2 

These conditions are derived for infinitesimal strains referred to rect¬ 
angular Cartesian coordinates. If the general curvilinear coordinates are 
used, we define the infinitesimal strain by 

(5) Gij = — ( Ui\j + Uj\i) i 

where Ui\j is the covariant derivative of tq, etc. The corresponding compat¬ 
ibility condition is then 

(b) e ij\ki T &ik\ji eji\ik 0 • 

If, however, the strain is finite so that e z j is given by Eq. (1) in rectangular 
Cartesian coordinates, or by 

(7) e i j = 2 \ u i\j T — u a\i u lj] > 

in general coordinates, then it is necessary to use a new method of deriva¬ 
tion. A successful method uses the basic concept that the compatibility 
conditions say that our body initially situated in a three-dimensional Eu¬ 
clidean space, must remain in the Euclidean space after deformation. A 
mathematical statement to this effect, expressed in terms of the strain 
components, gives the compatibility conditions. (See Probs. 2.31 and 4.9. 
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Pull expressions can be found in Green and Zerna , 12 Theoretical Elastic¬ 
ity, p. 62.) 

Let us now return to the question posed at the beginning of this sec¬ 
tion and inquire whether conditions (3) and (4) are sufficient to assure the 
existence of a single-valued continuous solution of the differential Eqs. (2) 
up to a rigid-body motion. The answer is affirmative. Various proofs are 
available, the simplest having been given by E. Cesaro 4 1 in 1906. (See 
notes in Bibliography 4.1.) The proof may proceed as follows. 

Let Pq(xi,X2,x®) be a point at which the displacements n? and the 
components of rotation w? are specified. The displacement tq at an arbi¬ 
trary point P (x\, 2 : 2 , 23 ) is obtained by a line integral along a continuous 
rectifiable curve C joining Po and P, 


( 8 ) 

But 


Ui(x i,x 2 ,x 3 ) = u° + 


f dui = u 0 i+ [ 

JP 0 JPc 


duj 

dx k 


dx k 


( 9 ) 


dui 

dx k 


1 [Yj9wi duk\ / din 

2 [v^Xfc dxi) \dx k 



= enc - u>ik , 


according to the definitions of the infinitesimal strain and rotation tensors, 
eij and u>ij, respectively, as given by Eqs. (4.2:6) and (4.4:1). Hence, Eq. ( 8 ) 
becomes 

f P ( P 

( 10 ) Ui(xx,X2,X2) — u? + / eikdx k - / u>ikdx k . 

JPo JPo 

We must eliminate a >i k in terms of ej*,. To achieve this, the last integral is 
integrated by parts to yield 



where ui^ k is the value of Wi k at Po. In Eq. ( 11 ) we have replaced dx k by 
dx k — dx k , to have appear outside the integral instead of L 0 lk at P. 
This is permissible, since P is fixed with respect to the integration so that 



112 


ANALYSIS OF STRAIN 


Chap. 4 


dx k = 0. Now, 

, ,. _duHk _ 1 / d 2 Uj _ <9 2 n fc \ 

< 9 #/ 2 \dxidxk dxidxi) 

1 / S 2 Uj d 2 ui \ 1 / d 2 u k d 2 ui \ 

2 \dxkdxi dxkdxi J 2 \dx{dxi dxidx k ) 

d l f dui dui \ d 1 / duk dui \ 

dxk 2 \ dxi * dxi J dxi 2 \ dxi dxk J 


_ deg _ deu 

dx k dxi 

Substitution of Eqs. ( 11 ) and (12) into Eq. (10) yields 

f P 

(13) Ui(x i, X 2 , £ 3 ) =u° - (xk - x° k )uj° k + / Ugdx 1 , 

JPo 

where 

(14) U a + . 

Now, if Wt(Si,S 2 >$ 3 ) is to be single-valued and continuous, the last 
integral in Eq. (13) must depend only on the end points Pq, P and must not 


be dependent on the path of integration C. There¬ 
fore, the integrand must be an exact differential. (See 
Sec. 4.7 below.) The necessary condition that Undx l 
be an exact differential is (see Sec. 4.7) 


(15) 


dUil dUir, 

dx m dxi 


This condition also suffices in assuring the single¬ 
valuedness of the integral if the region is simply con¬ 
nected. However, for a multiply connected region, some 
additional conditions must be imposed; they will be 
discussed in the next section. 

Now we obtain, from Eqs. (14) and (15), 


deg , (deu 

rs Ukm I n 

OXm \ OXk 


de fc A 

dxi J 


+ (** - 


f de a 

V dx k 



Pig. 4.6:2. Paths 
of integration. 


de k l \ 

dxi ) 


+ $kl 


/ ■ 

V dx k 


de km \ 

dxi ) 


(X k Xk)^ 


( dCijji 

V dx k 


dekm "N 
dxi ) 


= 0 


de ijr , 

dxi 
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The factors not multiplied by Xk — Xk cancel each other, and the factors 
multiplied by Xk — Xk form exactly the compatibility condition given by 
Eq. (3). Hence, Eq. (15) is satisfied if the compatibility conditions (3) are 
satisfied. Thus, we have proved that the satisfaction of the compatibility 
conditions (3) is necessary and sufficient for the displacement to be single¬ 
valued in a simply connected region. For a multiply connected region, 
Eq. (3) is necessary, but no longer sufficient. To guarantee single-valuedness 
of displacement for an assumed strain field, some additional conditions as 
described in Sec. 4.7 must be imposed. 

Problem 4.1. Consider the two-dimensional case in unabridged notations, 


du 

dx 


e. 


dv ___ du dv_ 

dy~ evv ’ foj + di~ xv ' 


Prove that in order to guarantee the solutions u(x,y), v(x, y) to be single-valued 
in a simply connected domain D(x, y), the functions e xx (x, y), e yy (x , y), e xy (x,y) 
must satisfy the compatibility condition 

d 2 e xx d 2 e vy = d 2 e xy 
dy 2 dx 2 dxdy 

Prove the sufficiency of this condition in unabridged notations. 


4.7. MULTIPLY CONNECTED REGIONS 

As necessary conditions, the compatibility equations derived in Sec. 4.6 
apply to any continuum. But the sufficiency proof at the end of the pre¬ 
ceding section requires that the region be simply connected. For a multiply 
connected continuum, additional conditions must be imposed. 

A region is simply connected if any simple closed contour drawn in the 
region can be shrunk continuously to a point without leaving the region; 
otherwise the region is said to be multiply connected. 

Figure 4.7:l(a) illustrates a simply connected region 1Z in which an 
arbitrary closed curve C can be shrunk continuously to a point without 
leaving 71. Figures 4.7:l(b) and 4.7:2 illustrate doubly connected regions in 
two and three dimensions, respectively. Figures 4.7:1(c) and (d) show how 
multiply connected regions can be made simply connected by introducing 
cuts — imaginary boundaries. 

The condition (4.6:15) is based on Stokes’ theorem, and we shall review 
the reasoning in preparation for discussing multiply connected regions. In 
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(b) 



(d) 


Fig. 4.7:1. Two-dimensional examples of simply and multiply connected regions. In 
(a), region 1Z is simply connected; in (b), 7 Z is doubly connected. 



Fig. 4.7:2. A torus doubly connected in three dimensions. 

two dimensions, Stokes’ theorem for continuous and differentiable functions 
P(x,y),Q(x,y) may be written as 

(1) / c (P,fc + « = // K (|g-^)< bdy, 

for a simply connected region Kona plane x, y bounded by a closed contour 
C. In three dimensions, Stokes’ theorem states, for three functions P, Q, R, 
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(2) f (Pdx + Qdy + Rdz) = / f 
Jc J Jn 


fdR 

V d v 


dQ 

dz 


dP dR\ . . (dQ 8P\ , . 

Tz ~ Si) “ s( "' y) + cos(l '’ z) us - 


for a simply connected region 1Z on a surface 5 bounded by a closed contour 
C. The factor cos(i/, x) is the cosine of the angle between the rr-axis and 
the normal vector v normal to the surface S. When the conditions 

dR_dQ :=0 9P_dR =Q £^_^ = 0 

dy dz ' dz dx ’ dx dy ’ 

are imposed, the line integral on the left-hand side of Eq. (2) vanishes for 
every possible closed contour C, 


L 


(Pdx + Qdy + Rdz) = 0. 


When this result is applied to any two possible paths of integration between 
a fixed point Pq and a variable point P, as in Fig. 4.6:2, we see that the 
integral 

rP 

( 4 ) 

IPo 

must be a single-valued function of the point P, independent of the path 
of integration; because in Fig. 4.6:2 the curve PqCPLPo forms a simple 
contour. Hence, 


F(P)= [ (Pdx + Qdy + Rdz) 
JPo 



Thus, the last two integrals are equal. This is the result used in Sec. 4.6 
to conclude that the compatibility conditions as posed are sufficient for 
uniqueness of displacements in a simply connected region. 

Let us now consider a doubly connected region on a plane as shown in 
Fig. 4.7:l(b). After a cut is made as in Fig. 4.7:l(c), the region TZ is simply 
connected. Let the two sides of the cut be denoted by (+) and (—) and let 
the points A + and A~ be directly opposite to each other on the two sides 
of the cut. The line integrals 

rj 4 "*" 

F(A~)= / (Pdx + Qdy), F(A + ) = / (Pdx + Qdy ), 

JPo JPo 
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integrated from a fixed point Po along paths in the cut region, are both 
single-valued. But, since A + and A~ are on the opposite sides of the cut, 
the values of the integrals need not be the same. Let the chain-dot path 
PoA + in Fig. 4.7:1 (c) be deformed into PqA~ plus a line £ connecting A~ 
to A + , encircling the inner boundary. Then 

rA r 

/ = / + / • 

JPo J Po JC, 

Hence, 

(5) F(A + ) - F(A~) = J (Pdx + Qdy ). 

Similar consideration for another pair of arbitrary points B + and B~ across 
the boundary leads to the same result, 

(6) F(B+) - F(B~) = j {Pdx + Qdy). 

The right-hand sides of Eqs. (5) and (6) are the same if P and Q are single¬ 
valued in the region 71, because a line £' connecting B~ to B + , encircling 
the inner boundary, may be deformed into one that goes from B~ to A~ 
along the cut, then follows C from A~ to A + , and finally moves from A + to 
B + along the + side of the cut. If P and Q were single-valued, the integral 
from B~ to A~ cancels exactly the one from A + to B + . 

From these considerations, it is clear that function F(P) defined by the 
line integral (4) will be single-valued in a doubly connected region if the 
supplementary condition 

(7) J (Pdx + Qdy) = 0, 

is imposed, where £ is a closed contour that goes from one side of a cut to 
the other, without leaving the region. 

The same consideration can be applied generally to multiply connected 
regions in two or three dimensions. An (m + 1 )-ply connected region can 
be made simply connected by m cuts. In the cut, simply-connected region, 
m independent simple contours £i,£ 2 ,...,£ m can be drawn. Each £j 
starts from one side of a cut, and ends on the other side of the same cut. 
All cuts are thus embraced by the £’s. Then the single-valuedness of the 
function F(P) defined by the line integral (4) can be assured by imposing, 
in addition to the conditions (3), m supplementary conditions 

(8) f (Pdx + Qdy + Rdz) = [ ( ) = ■■•=/ ( ) =0. 

J Cl Jc 2 J c m 
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When these results are applied to the compatibility problem of Sec. 4.6 
we see that if the body is (m + l)-ply connected, the single-valuedness of 
the displacement function Uj(aq, X 3 ) given by Eq. (4.6:13) requires m x 3 
supplementary conditions: 

(9) A [ U u dx l = • • ■ = [ U u dx l = 0, i = 1,2,3, 

JC\ Jc m 

where 

( 10 ) A f / ii = e ii + (*fc-a:fc)(~--^) , 

and C\, ..., £ m are m contours in 7 Z as described above. 

4.8. MULTIVALUED DISPLACEMENTS 

Some problems of thermal stress, initial strain, rigid inclusions, etc., can 
be formulated in terms of multivalued displacements. For example, if the 
horseshoe in Fig. 4.8:1(a) is strained so that the faces Si and S 2 come into 
contact and are then welded together, the result is the doubly connected 
body shown in Fig. 4.8:l(b). If the strain of the body (b) is known, we 



(a) (b) 

Fig. 4.8:1. Possibility and use of multivalued displacements. 


can use the procedure described in Sec. 4.6 to compute displacements and 
generally we will obtain the configuration of Fig. 4.8:l(a). In this case a 
single point P on the welded surface S will open up into two points P\ and 
P ‘2 on the two open ends. Thus the displacement at P is double-valued. 

In the example posed here, the strain components and their derivatives 
may be discontinuous on the two sides of the interface S of Fig. 4,8:l(b). 
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Hence, the value of a line integral around a contour in the body that crosses 
the interface S will depend on where the crossing point is. In other words, 

(1) Ui (P 2 ) - Ui(P0 = [ U u dx l , 

Jc(P) 

depends on the location of P, where £(P) is a contour passing through the 
point P as indicated in the figure and Uu is given in Eq. (4.6:14) or (4.7:10). 

On the other hand, if we have a ring as in Fig. 4.8:l(b) and apply some 
load on it and ask what is the corresponding deformation, we must first 
decide whether some crack is allowed to be opened or not. If the ring is to 
remain integral, the deformation must be such that the displacement field 
is single-valued. In this case the strain due to the load must be such that 

(2) [ Uudx 1 = 0, 

JC(P) 

for a contour £(P) as shown in Fig. 4.8:l(b). Of course, Eq. (2) in conjunc¬ 
tion with the compatibility conditions (4.6:4) assures that the line integral 
J Uudx 1 will vanish on any contour in the ring. Thus the point P in Eq. (2) 
is arbitrary. 

These examples show that both single-valued displacement fields and 
multivalued displacement fields have proper use in answering appropriate 
question. 

4.9. PROPERTIES OF THE STRAIN TENSOR 

The symmetric strain tensor e l j has many properties in common with 
the stress tensor. Thus, the existence of real-valued principal strains and 
principal planes, the Mohr circle for strain, the octahedral strain, strain 
deviations, and the strain invariants need no further discussion. 

The first strain invariant, 

( 1 ) e = e\\ + e 2 2 + e 33 = sum of principal strains, 

has a simple geometrical meaning in the case of infinitesimal strain. Let a 
volume element consist of a rectangular parallelepiped with edges parallel 
to the principal directions of strain. Let the length of the edges be 1 1 , l 2 , 
I 3 in the unstrained state. Let en, e 22 , 633 be the principal strains. In the 
strained configuration, the edges become of lengths L (1 + e ii)> £ 2(1 + 622 )* 
Z 3 (l -f 633 ) and remain orthogonal to each other. Hence, for small strain 
the change of volume is 

AH = l\l 2 h{l + en)(l + e22)(l + e 33 ) - l\hh 

= hhh{&n + 622 + 633) • 
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Therefore, 

(2) e = e u = AV/V . 

Thus, in the infinitesimal strain theory the first invariant represents the 
expansion in volume per unit volume. For this reason, e is called the 
dilatation. 

If two-dimensional strain state (plane strain) is considered u 3 or w = 0, 
the first invariant en + e 22 represents the change of area per unit area of 
the surface under strain. 

For finite strain, the sum of principal strains does not have such a simple 
interpretation. 


PROBLEMS 

4.2. A state of deformation in which all the strain components are constant 
throughout the body is called a homogeneous deformation. What is the equation, 
of the type f(x,y,z) = 0, of a surface which becomes a sphere x 2 + y 2 + z 2 = 
r 3 after a homogeneous deformation? What kind of surface is it? (x, y, z are 
rectangular Cartesian coordinates.) 

4.3. Show that a strain state described in rectangular Cartesian coordinates 

e xx = k(x 2 + y 2 ) , e yv = k(y 2 + z 2 ) , e xy = k'xyz, 
e xx — — e x z ~ 0, 

where k, k' are small constants, is not a possible state of strain for a continuum. 

4.4. A solid is heated nonuniformly to a temperature T(x,y,z). If each 
element has unrestrained thermal expansion, the strain components will be 

e xx ~ c ,,y — e x z ~~~ ocT , 

e xy —— e y z — e zx 0, 

where x, y, z, are rectangular Cartesian coordinates and a is the thermal expan¬ 
sion coefficient (a constant). Prove that this can only occur when T is a linear 
function of x, y, z; i.e., T = c\x + c^y + C 3 Z + C 4 , where ci,... ,C 4 are constants. 

4.5. A soap-film like membrane stretched over a ring is deformed by uniform 
pressure into a hemisphere of the same radius (Fig. P4.5). In so doing, a point 
P on the fiat surface over the ring is deformed into a point Q on the sphere. 
Determine a mathematical transformation from P to Q. Note: A “soap-film 
like” membrane shall be defined as a membrane in which the tension is a constant, 
isotropic, and independent of the stretching of the membrane. 

Let P be referred to plane polar coordinates (r, 6 ) and Q be referred to spher¬ 
ical polar coordinates (R,9,<fr) with the same origin. We assume that R = a, 
0 = 6 . The problem is to determine the angle <p as a function of r. 
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P{r, e) 




P4.5 



P4.6 


If the thickness of the film is uniform originally, it will not remain so after 
being blown into spherical shape. Determine the thickness distribution in the 
final configuration under the assumption of material incompressibility. 

4.6. Consider a two-dimensional inflatable structure which consists of an 
infinitely long tube. When folded flat, the cross section of the tube appears like 
a line segment AB of length 2 in Fig. P4.6. When blown up with an internal gas 
pressure p , the tube assumes the form of an ellipse with major axis a and minor 
axia f3. 

Questions: 

(a) Devise (arbitrarily, but specifically) a law of transformation which 
transforms the folded tube into the elliptic cylinder. 

(b) Compute the components of membrane strain in the midsurface of the 
tube referred to the original folded tube (components of Green’s strain 
tensor). 

(c) Compute the corresponding components of strain referred to the blown 
up tube (components of Almansi’s strain tensor). 

(d) For equilibrium, what must be the distribution of membrane tension 
force T in the blown-up tube? (The internal pressure is uniform.) 

(e) Assume that T = khe\ , where T is the membrane tension, e L is the 
strain component of Almansi on the midsurface of the tube wall, h is the 
wall thickness of the blown up tube, and A; is a constant. Determine h. 

(f) If the tube material is incompressible (like natural rubber), what would 
have to be the initial thickness distribution? 
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4.10. PHYSICAL COMPONENTS 

In Sec. 2.16 we defined the physical components of a vector v as the 
components of v resolved in the directions of a set of unit vectors which 
are parallel either to the set of base vectors or to the set of reciprocal base 
vectors. Thus, 


( 1 ) 


Tensor components : 
Physical components : 
Reference base vectors : 


v r v r 

V y/,9rr L’ r %/ Q rr 

gr g r 


All the physical components have uniform physical dimensions; but they 
do not transform conveniently under coordinate transformations. 

In Sec. 3.13 we defined the physical components of the stress tensors r lj 

i 

as the components of the stress vector T resolved in the directions of unit 
vectors parallel to the base vectors. If the physical components of r y are 
denoted by , we have 


( 2 ) 



i,j not summed. 


i 

Correspondingly, when T is resolved with respect to the reciprocal base 
vectors, the physical components are 


( 3 ) 



i,j not summed. 


All components cr li , erj have uniform physical dimensions. The physical 
components of the contravariant, covariant, and mixed tensors are the same 
if the coordinates are orthogonal. 

Now consider the strain tensor ey, which is covariant in the most natural 
form of definition (Sec. 4.1). If ds is the length of an element (d0 1 ,d9 2 1 dO z ) 
at a point (0 1 ,# 2 ,# 3 ), and dso is the length of the same element before the 
deformation takes place, then 


( 4 ) 


ds 2 — dsQ = 2 eij d9 l d&°. 


The differential element (dd l ) is a vector, whose physical components 
are y/gud9\ (i not summed). We may rewrite (4) as 

ds 2 - ds 2 = ~z= {^d*){Vand9*). 

i=l j = l w9ii\Jyjj 


( 5 ) 
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Since all the components ^/giidO 1 , ^/gj] dO 3 have the dimension of length, 
we may define the physical strain components 

(6) dj = ———- — e v} , i,j not summed , 

V&hi 

which are all dimensionless. 

The rules of forming physical components now appear rather compli¬ 
cated. The appearance can be made more systematic if we utilize the 
relation given in Eq. (2.14:9), i.e., 

( 7 ) • gi = S' , 

which implies 

(8) g u = ( ga ) _1 , i not summed. 

Then Eqs. (2), (3), and (6) may be written as 


(9) 

= y/(Hi y/gjj T lJ , 

(10) 

°j = y/gTi rj , 

(11) 

e ij = \Ftf i xffieij 


where i,j are not summed. These formulas, similar to Eq. (1), may serve 
as a pattern for defining the physical components of any tensor in any 
curvilinear coordinates. In orthogonal curvilinear coordinates, cr 13 = erj. 

Let us repeat. The physical components and the tensor components of 
a physical quantity have the same geometric and physical interpretations, 
except for physical dimensions. The unification in dimensions is achieved by 
multiplying the tensor components with appropriate scale factors, which are 
related to the components of the metric tensor. The physical components 
do not transform conveniently under general transformation of coordinates. 
In practical applications, therefore, we generally write basic equations in 
the tensor form and then substitute the individual tensor components by 
their physical counterpart, if it is so desired. 

We shall illustrate these applications in the following sections. 
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4.11. EXAMPLE — SPHERICAL COORDINATES 

Let x 1 = r, x 2 = <£, x 3 — 6 (Fig. 4.11:1). Then 



ds 2 = dr 2 + r 2 d(j ) 2 + r 2 sin 2 4 >d 6 2 , 

ffil = 1, ff22 - r 2 , 

033 = r 2 sin 2 <j>, gij = 0 if i ± j . 

yz Thus the coordinate system is orthogonal. 
From the definition of g af3 , we have 


Fig. 4.11:1. Spherical polar 
coordinates. 


0 U = 1 , 


9 ~~ 2 • 2 

r z sin ( 


, g ij = 0 if i^j. 


The Euclidean Christoffel symbols are 

F 22 = — r > ri 3 = -rsin 2 <£, 

r i 2 = r 2 i = “ > r| 3 = - sin <j> cos <j >, 

r 

^13 = r 3i = -, r|a = r 32 = cot </>, 

r 

all other Fj k =0. 

Let Ui be the covariant components of the displacement vector. We have 
the infinitesimal strain tensor components 

l/i in 1 / dui ditj \ 

Sij ~ + _ 2 \d + d^J~ FijUa ' 


du 2 

622 = -dt +rUl 


633 — - 7 ^- + r sin 2 <j> u\ 4- sin <j> cos <fr u 2 , 
1 / dui du 2 \ 1 


2 \ d(/> dr 
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1 , 

( du 2 

du 3 \ 

1 — cot (pu 3 

623 = 2 1 

{ 89 

+ 50 ) 

1 , 

( du 3 

dui \ 

1 

631 = 2 1 

\ dr 

+ ^e) 

— U 3 . 
r 


The m and ey are tensor components. Let the corresponding physical 
components of the displacement vector be written as and that of 

the strain tensor be written as Then, since the spherical coordinates 
are orthogonal, 


Therefore, 


e n 


£22 


£33 


e 12 


Zr = y/ff" W1 

Ze = \fg^u 3 


9Zr 

dr ’ 

dZ ± + Z L 

rdtp r 


■ , 
«3 


Z* = y/g® 


U 2 


U 2 


= V9 u 9 jj 


1 


rsin0 89 

ldz 


dZo 1 1 cot 0 
+ ~Zr H- 


_ + 1 d(rZ<t>) 

r d(j> r dr 


Idir dZ* 

r dtp dr 
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The equations of equilibrium now become, with F r , F^, Fg denoting the 
physical components of the body force vector, 
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-fr = ~2a~V a rr) + ——7 HT ( sm <P &rct>) 
r* dr rsin0 50 


1 


r sin 0 89 


d 1 
o r e —( 


' + Pd 




8 


r sin 0 dtp 
1 8 


(smpa^) + 


~ Fe = -37r( r3a 0r) + . 2 M 

r J dr r sm <p dtp 
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4.12. EXAMPLE — CYLINDRICAL POLAR COORDINATES 


Letting £ r , be the physical components of the displacement and 

e rr , €r8, ■ ■ ■, oy r , cr r g, ■■■, etc., be the physical components of the strain and 
stress, respectively, we obtain the following results: 

x 1 = r , x 2 = 9 , x 3 = z , 


ds 2 = dr 2 + r 2 d 6 2 + dz 2 , 

9 n = l, 022 = r 2 , 033 ~ 1 1 all other gij = 0 , 

g n = 1, g 22 = ~2 , 0 33 = 1, all other gd — 0, 

r 22 = ~ r . r 2i = r 2i = ^ > aJ1 other r jk = 0 ■ 

Hence, 


d& 

dr 

1 

2 


1 

£M "r¥ + 7' 


£r0== 2 


l 

dz r dO 

ld£r d& 

r dO dr 


to 


) 


The equations of equilibrium are 


= dZz 

dz ’ 

«e* , a€r 

<9z 


-Fr = -§-(r* r 


. 1 3 0 

M-+ 

r <90 


<9z 


„ 1 9 . 2 . 1 9 


-F, 


1 9 

r 2 


5 


+ -x - 
dz 


1 9 
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r 


PROBLEMS 

4.7. Give proper definitions of strain tensors e'\ ej in general frames of 
reference in terms of ey. Show that e^-, e*^ are symmetric tensors. Show that 
the principal strains ei, e 2 , e 3 are the roots of the characteristic equation 

je‘-e<5*| = 0 

and that the first invariant is e\. 

4.8. The generalization of the expressions of strain tensors in terms of the 
displacement vector field u, which represents the displacement of any particle as 
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the body deforms as explained in Sec. 4.2, can be done as follows. The displace¬ 
ment vector u can be resolved into covariant and contravariant components Ui 
and u ', respectively, along base vectors and reciprocal base vectors defined in the 
original coordinates 01 , 02 , 03 . The Green’s strain tensor is 

Eij = ^[uj\i + Ui\j + u k \iu k \j ], 

where Ui\j is the covariant derivative of u, with respect to a,j. 

The displacement vector u can be resolved also into covariant and contravari¬ 
ant components Ui and u' along base vectors and reciprocal base vectors defined 
in the Eulerian coordinates £ 1 , 2 : 2 , £ 3 - Then the Almansi strain tensor is 

e 0 = ^Mi +«i|li - w fc ||iWfc|li], 

where a double bar is used to indicate that iq, u j and the covariant derivatives 
are referred to the coordinates Xi. 

Prove these statements. 

4.9. The condition of compatibility for finite strain can be derived easily if 
convected coordinates are used [see Eq. (4.1:15)]. By the fact that the space of 
the deformed body is Euclidean, derive the compatibility conditions. Hint: See 
comments on p. 101 and Prob. 2.31. Note: however, the properties = 0, 
gij\k — 0 are valid in Riemannian space as well as Euclidean space. The feature 
that distinguishes an Euclidean space is the vanishing of the Riemann-Christoffel 
curvature tensor R p3t of Prob. 2.31. Expressing R' p3t = 0 in terms of the metric 
tensors before and after deformation leads to the compatibility conditions. 
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CONSERVATION LAWS 


We shall discuss in this chapter the basic laws of conservation of mass 
and momentum. A rectangular Cartesian frame of reference will be used 
throughout. All tensors are Cartesian tensors. 


5.1. GAUSS’ THEOREM 

Consider a convex region V bounded by a surface S that consists of a 
finite number of parts whose outer normals form a continuous vector field 
(Fig. 5.1:1). Let a function A(xi,x 2 ,x 3 ) be defined and differentiable in 
the region V + S. Then, by the usual process of integration, we obtain 

( 1 ) JJj ^-dx\dx 2 dx 3 ~ jj (A* ~ A**) dx 2 dx 3 , 



integral in Eq. ( 1 ) can be written as 


where A* and A** are the values of A 
on the surface S at the right and left 
ends of a line parallel to the £i-axis, re¬ 
spectively. The factors ±dx 2 dx 3 in the 
surface integral in Eq. (1) are the pro¬ 
jections of the x 2 , X 3 -plane of the areas 
dS* and dS** at the ends of a line par¬ 
allel to the xi-axis. Let u — (i^i, i/ 2 , 1 / 3 ) 
be the unit vector along the outer nor¬ 
mal of S. Then dx 2 dx 3 = v*dS * at 
the right end and dx 2 dx 3 — -v\*dS** 
at the left end. Therefore, the surface 


/ (j4V?dS* + A**vl*dS”) = f Av\dS. 
Js Js 

Thus, Eq. (1) may be written as 


L 


~dV= ( Au\dS , 
v dxi Js 
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where dV and dS denote the elements of V and S, respectively. A sim¬ 
ilar argument applies to the volume integral of dA/dX 2 , or dA/dx 3 . In 
summary, we obtain Gauss’ theorem 

(2) W/ v =L MdS - 

This formula holds for any convex regular region or for any region that 
can be decomposed into a finite number of convex regular regions, as can 
be seen by summing Eq. (2) over these component regions. 

Now let us consider a tensor field Ajki...- Let the region V with boundary 
surface S be within the region of definition of Ajki.... Let every component 
of Ajki... be continuously differentiable. Then Eq. (2) is applicable to every 
component of the tensor, and we may write 


(3) A 


X 


dxi 


*jkl. 


dV 


\ Vi 


*jki. 


dS. 


This is Gauss’ theorem in a general form. 


Problem 5.1. Show that 


J 4>,i dV = 

J <j)Vi dS 

or 

J grad (j)dV = J 

i/<pdS , 

J Ui,i dV = 

J UiVidS 

or 

J div u dV — J 

1 / ■ u dS , 

t'ijk'U'kfj dV = 

Cijk j Ufc.j 

dV 

— Cijk j 0.klSj dS ~ 

J^ CijkUklSj dS , 


or 


/ 


curl u dV = 


J v x u dS , 


where is the permutation tensor. Verify that these formulas are also valid 
in two dimensions, in which case the range of indices is 1, 2 and the volume and 
surface integrals are replaced by surface and line integrals, respectively. 


5.2. MATERIAL AND SPATIAL DESCRIPTIONS OF 
CHANGING CONFIGURATIONS 

We shall speak of a particle in the sense of a material particle as we 
know it in Newtonian particle mechanics. The instantaneous geometric 
location of a particle will be spoken of as a point. A body is composed of 
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*1, °\ 


Pig. 5.2:1. Labeling of particles. 


particles. To label the particles of a body we choose a Cartesian frame of 
reference and identify the coordinates of the particles at a time t = 0 as 
(01,02,03) (Fig. 5.2:1). At a later time the particle moves to another point 
whose coordinates are (uq, x%, £3), referred to the same coordinate system. 
The relation 

(1) Xi = Xi(a\, U2,03, t ), i = 1,2,3, 


links the configurations of the body at different instants of time. The 
functions Xi are single-valued continuous functions whose Jacobian does 
not vanish. 

A basic property of bodies is that they have mass. In classical mechan¬ 
ics, mass is assumed to be conserved; i.e., the mass of a material body 
is the same at all times. In continuum mechanics it is further assumed that 
the mass is a continuous function of volume. In other words, it is assumed 
that a positive quantity p, called density, can be defined at every point 
in the body as the limit 


, , mass of Bk 

p(x) = hm — ; -—— 

00 volume of Bk 


where Bk is a suitably chosen infinite sequence of particle sets shrinking 
down to the point x; the symbol x stands for {xi,X2,x$). At time t = 0, 
the density at the point (01,02,03) is denoted by po(&)- 
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The conservation of mass is expressed in the formula 


( 3 ) 


J p(x)dxidx 2 dx 3 = J po(a)daida 2 da 3 , 


where the integrals extend over the same particles. Since 


J P (x)d Xl d X2 dx 3 = J p(x) 


da\da 2 da 3 , 


and since this relation must hold for all bodies, we have 


( 4 ) A 


Po( a ) = P(x) 


d X j 

da,j 


p(x) = p 0 (a) 


dat 

dxj 


where \da.i/d X j\ denotes the determinant of the matrix {dai/dxj}. These 
equations relate the density in different configurations of the body to the 
transformation that leads from one configuration to the other. 

For the particle (01,02,03) whose trajectory is given by Eq. ( 1 ), the 
velocity is 


( 5 ) A Vi{a,t) = ~ Xi (a, t), 

and the acceleration is 


, , , „ d 2 , . d , . 

(6) A Vi(a,t) = jpxifat) = —Vi(a,t), 

where a stands for (01,02,03) and is held constant. 

A description of mechanical evolution which uses (01,02,03) and t as 
independent variables is called a material description. In hydrodynamics, 
traditionally, a different description, called the spatial description, is used. 
In the spatial description, the location (x\, x 2 , x 3 ) and time t are taken as 
independent variables. This is convenient because measurements in many 
kinds of materials are more directly interpreted in terms of what happens 
at a certain place, rather than following the particles. These two methods 
of description are commonly designated as the Lagrangian and the Eulerian 
descriptions, respectively, although both are due to Euler. The variables 
a i,a2, a 3,f are usually called the Lagrangian variables, whereas x^, x 2 , x 3 ,t 
are called the Eulerian variables ; they are related by Eq. ( 1 ). For a given 
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particle, it is convenient to speak of (01,02,03) as the Lagrangian coordi¬ 
nates of the particle at (x\, X2, *3). 

In a spatial description, the instantaneous motion of the body is de¬ 
scribed by the velocity vector field Vi(x\,X2,x 3 ,t) associated with the in¬ 
stantaneous location of each particle. The acceleration of the particle is 
given by the formula 


1 ' 

( 7 ) A v i (x,t) = -^(x,t) + v j -^-(x,t), 

where x again stands for the variables x\, x 3 , m3, and every quantity in 
this formula is evaluated at (x, t). The proof follows the fact that a particle 
located at (x \, x 3 , x 3 ) at time t is moved to a point with coordinates Xi+Vidt 
at the time t + dt; and that, according to Taylor’s theorem, 


t>i(x, t)dt — Vi(xj + Vjdt , t + dt) — Vi(x, t) 


= v , + ^-dt + 


dvj(x,t) 

dxj 


Vjdt — Vi , 


which reduces to Eq. ( 7 ). The first term in Eq. ( 7 ) may be interpreted as 
arising from the time dependence of the velocity field; the second term as 
the contribution of the motion of the particle in the instantaneous velocity 
field. Accordingly, these terms are called the local and the convective parts 
of the acceleration, respectively. 

The same reasoning that led to Eq. ( 7 ) is applicable to any function 
F(xi, X2, £3, t) that is attributable to the moving particles, such as the 
temperature. A convenient terminology is the material derivative, and it is 
denoted by a dot or the symbol D/Dt. Thus 


(8) A 


F 


DF (dF\ dF dF dF 

Dt ~\dt J x=const . + Vl d Xl +V2 dx 2 +V3 dx 3 



a=const. 


where a = (<zi, 02,03) is the Lagrangian coordinate of the particle which is 
located at x at the time t, connected by Eq. (1) 


5.3. MATERIAL DERIVATIVE OF VOLUME INTEGRAL 

Consider a volume integral taken over the body 

1 = f A(x, t) dV, 

Jv 


(1) 
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where A(x,t ) denotes a property of the continuum and the integral is 
evaluated at an instant of time t. We may wish to know how fast the 

body itself sees its own value of I 
is changing, so it is of interest to 
know the derivative of I with respect 
to time for a given set of particles. 
Now the particle at x l at the instant 
t will have the coordinates x[ = Xi + 
Vidt at the time t+dt. The boundary 
S of the body at the instant t will 
have moved at time t + dt to a neigh¬ 
boring surface S', which bounds the 
Fig. 5.3:1. Continuous change volume V (Fig. 5.3.1). The material 
of the boundary of a region. derivative of I is defined as 

(2) TT = 4- l A(x',t + dt) dV'- f A(x,t)dV . 

' Dt dt-*o dt Jv Jv 

Now there are two contributions to the difference on the right-hand side 
of Eq. (2); one over the region Vo where V and V' share in common, and 
another over the region V\ where V and V' differ. The former contribution 
to DI/Dt is evidently 



The latter contribution comes from the value of A on the boundary mul¬ 
tiplied by the volumes swept by the particles on the boundary in the time 
interval dt. If Vi is the unit vector along the exterior normal of S, then, 
since the displacement of a particle on the boundary is v^dt, the volume 
swept by particles occupying an element of area dS on the boundary S is 
dV — ViWidSdt. The contribution of this element to DI/Dt is Av^idS. 
The total contribution is obtained by an integration over S. Therefore, 

< 3 > 1 w,I v AdV = i^ dV+ l AviUldS - 

Transforming the last integral by Gauss’ theorem and using Eq. (5.2:8), we 
have 
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This important formula will be used over and over again below. It should 
be noted that the operation of forming the material derivative and that of 
spatial integration is noncommutative in general. 

5.4. THE EQUATION OF CONTINUITY 

The law of conservation of mass has been discussed in Sec. 5.2. With 
the results of Sec. 5.3, we can now give some alternative forms. 

The mass contained in a region V at a time t is 

(1) m— f pdV , 

Jv 

where p = p(x, t) is the density field of the continuum. Conservation of mass 
requires that Dm/Dt = 0. The derivative Dm/Dt is given by Eqs. (5.3:3) 
and (5.3:4), when A is replaced by p. Since the result must hold for arbitrary 
V, the integrand must vanish. Hence, we obtain the following alternative 
forms of the law of conservation of mass. 


(2) A 

J v a i dV+ f s ™ l/ ‘ dS =°- 

(3) A 

Sl-g 3 

+ 

ii 

o 

(4) A 

P + P p- 0 . 

Dt H dxj 

These are called the equations of continuity. The integral form (2) is useful 
when the differentiability of pvj cannot be assumed. 

In problems of statics, these equations are identically satisfied. Then 
the conservation of mass must be expressed by Eq. (5.2:3), or (5.2:4). 



134 


CONSERVATION LAWS 


Chap. 5 


5.5. THE EQUATIONS OF MOTION 

Newton’s laws of motion state that, in an inertial frame of reference, 
the material rate of change of the linear momentum of a body is equal to 
the resultant of applied forces and that the material rate of change of the 
moment of momentum with respect to the coordinate origin is equal to the 
resultant moment of applied forces about the same origin. 

At an instant of time t, a regular region V of space contains the linear 
momentum 

( 1 ) Vi= f pVidV. 

Jv 


If the body is subjected to surface traction Tj and body force per unit 
volume Xi, the resultant force is 


(2) Ti= [ Tid,S+ / XidV. 

Js Jv 

According to the stress principle of Euler and Cauchy (Secs. 3.2 and 3.3), 
Ti = (JjiVj , where cry is the stress field and i/j is the unit vector along the 
exterior normal to the boundary surface S of the region V. Substituting 
into Eq. (2) and transforming into a volume integral by Gauss’ theorem, 
we have 


( 3 ) ^ 

Newton’s law states that 

( 4 ) 


/.( 


dm. 
\dx 3 


+ Xi)dV. 


Vi^Xi 


Hence, according to Eq. (5.3:4), with A identified with pvi, we have 

do, 


( 5 ) 


/, 


dp y i 0 ' 

-gT + 5 -( Wj ) 


dV = 


/.( 


V dxj 


+ Xi)dV. 


Since this equation must hold for an arbitrary region V, the integrand on 
the two sides must be equal. Thus 


( 6 ) 


dpv. 


d . . don 

at &rX v,v>) = ai- + x ‘- 


The left-hand side of Eq. (6) is equal to 

P 


Vi [ d -P + ^ 

11 at d Xi 


dvi 
' dxi 
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The first parenthesis vanishes by the equation of continuity (5.4:3), while 
the second is the acceleration Dvi/Dt. Hence 


(7) A 


P 


Dvi 

~Dt 


dcr K 

Ox, 


Xi. 


This is the Eulerian equation of motion of a continuum. The equation of 
equilibrium discussed in Sec. 3.4 is obtained by setting all velocity compo¬ 
nents Vi equal to zero. 

If differentiability of the stress field or the momentum field cannot be 
assumed, we may use Eq. (5.3:3) to compute DV/Dt. Then Eqs. (2) and 
(4) give Euler’s equation in the integral form , 


( 8 ) 


f ^f dV = I K - pViV^Vj dS + J XidV. 


The corresponding equations of static equilibrium are obtained, of 
course, by setting all velocity components to zero. 


5.6. MOMENT OF MOMENTUM 

An application of the law of balance of angular momentum to the par¬ 
ticular case of static equilibrium leads to the conclusion that stress tensors 
are symmetric tensors (see Sec. 3.4). We shall now show that no additional 
restriction to the motion of a continuum is introduced in dynamics by the 
angular momentum postulate. 

At an instant of time t, a body occupying a regular region V of space 
with boundary S has the moment of momentum [Eq. (3.2:2)] 

(1) Ui = 


X 


&ijk'Ej PVk 


dV, 


with respect to the origin. If the body is subjected to surface traction 

V 

Ti and body force per unit volume Xi, the resultant moment about the 
origin is 

( 2 ) Ci= f eijkXjX k dV + / e ijk XjTkdS. 

Jv Js 

V 

Introducing Cauchy’s formula T k = (JikVi into the last integral and 
transforming the result into a volume integral by Gauss’ theorem, we obtain 


( 3 ) 


C*i — / CijfcXjXfcdV “I" / dV . 

Jv Jv 
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Euler’s law states that for any region V 


( 4 ) 



= £i. 


Evaluating the material derivative of Hi according to (5.3:4), and using 
Eq. (3), we obtain 

(5) ^-ijk x j TjJ {P^k ) d” k X "j P^^k C/ ) — &ijk x jXk T j@Ik) ,1 * 


The second term in Eq. (5) can be written as 

eijkpVjVk + e ijk x j ( P v kVi) = e ijkXj~ (pv k vi). 


The last term in Eq. (5) can be written as eijkcrjk + e-ijk^j^ik.i- Hence, 
Eq. (5) becomes 


( 6 ) 


e ijk x j 


d 9 / v v 

m pVk + dx^ pVkVl ' ~ Xk ~ atk ’ 1 


&ijk&jk — b . 


The sum in the square brackets vanishes by the equation of motion (5.5:6). 
Hence, Eq. (6) is reduced to 


(?) 


&ijk&jk — b; 


i.e., (Tjk = okj ■ Thus, if the stress tensor is symmetric, the law of balance 
of moment of momentum is identically satisfied. 


5.7. OTHER FIELD EQUATIONS 

The motion of a continuum must be governed further by the law of 
conservation of energy. If mechanical energy is the only form of energy of 
interest in a problem, then the energy equation can be obtained by suitable 
integration of the equation of motion, Eq. (5.5:7). If the interaction of 
thermal process and mechanical process is significant, then the equation of 
energy contains a thermal energy term and is an independent equation to 
be satisfied. We shall discuss the energy equation in Sec. 12.2. 

The equations of continuity and motion, Eqs. (5.4:3) and (5.5:7), consti¬ 
tute four equations for ten unknown functions of time and position; namely, 
the density p, the three velocity components v % (or displacements tq) and 
the six independent stress components &ij. Further restrictions would have 
to be introduced before the motion of a continuum can be determined. One 
group of such additional restrictions comes from a statement about the 
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mechanical property of the medium in the form of a specification of stress- 
strain relationship. These specifications are called constitutive equations. 

The constitutive equations of elastic and plastic materials are discussed 
in Chapter 6 . The constitutive equations of thermomechanical materials are 
discussed in Chapters 12 and 14. The constitutive equations of viscoelastic 
materials are discussed in Chapters 13 and 15. The constitutive equations 
of solids with large deformation are discussed in Chapter 16. The constitu¬ 
tive equations of plates and shells are discussed in Chapter 20 . A field that 
is very rich in a variety of constitutive equations is biomechanics. See Fung 
Biomechanics: Mechanical Properties of Living Tissues, 1993, Biomechan¬ 
ics: Circulation, 1996, Biomechahics: Motion, Flow, Stress, and Growth, 
1990, Springer Verlag, and other references listed in the Bibliography. 

PROBLEMS 

5.2. Express the following statements in tensor equations. Define your 
symbols. 

(a) The force of gravitational attraction between two particles of inertial masses 
mi and m 2 , respectively, and separated by a distance r, is equal to 
Gmim 2 /r 2 and is directed toward each other. 

(b) The components of stress is a linear function of the components of strain. 

(c) A normal vector of a surface is perpendicular to any two line elements tangent 
to the surface (in particular, to tangents of the parametric curves). 

5.3. If the stress-strain law in rectangular Cartesian coordinates is 

Tij — A 65ij + 2 Geij , 6 — e aa , 

what is the proper form of tensor relation between r ,J and in general coordi¬ 
nates of reference? 

5.4. Discuss the appropriateness, from the tensorial point of view, of the 
following proposals for the constitutive equations in Cartesian tensors for certain 
materials. 

(a) = P(e mn )e,j, where P(e mn ) = aen + be 11 (a, b, constants). 

(b) Tij = P(e m n)eij, where P(e mn ) = a + 6 |ei | 2 (a, b, constants), and ei, e 2 , e$ 
are the three principal strains satisfying the relation ei > e 2 > e$. 

(c) Tij = Q(/i,/ 2 , If)eij, where I\, / 2 ,13 are the first, second, and third invari¬ 
ants of eij , respectively. 

(d) Tij = aSij + S e ij + 'yCikCkj + A eikek m e m j, where a, 0, 7 , A are constants. 

(e) If in (d), a, 0, 7 , A are permitted to depend on the stress components, what 
kind of combinations of the stress components would be allowed for a, 0, 7 , 
A to be functionally dependent upon? 
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ELASTIC AND PLASTIC BEHAVIOR 
OF MATERIALS 


In this chapter, some commonly used constitutive laws are considered. 
These laws describe the material behavior at isothermal condition in the 
range of relatively small strains, and with slow rates of flow. The plas¬ 
ticity theories discussed in this chapter are mathematical formulations of 
experimental observations. The approach is phenomenological. We shall 
discuss first the strain-rate-independent plasticity theory considering the 
yield surface, the flow rule, the hardening behavior, and the loading and 
unloading criterion. We then consider cyclic loading reversals and strain 
softening. It is, however, beyond the scope of this book to consider the 
atomic, crystalline or amorphous structural changes in plastic deformation, 
atomic reasons of how and why plasticity occurs, and plasticity at large 
strain. The plasticity considered here is appropriate to most problems in 
structural engineering, in which excessive plastic flow is undesirable; but it 
does not meet the needs to solve problems in metal forming, wire drawing, 
rolling, etc. 

We shall use Cartesian tensors in this chapter. In rectangular Cartesian 
coordinates the physical components of a tensor are the same as the tensor 
components, so stresses can be denoted by aij or cV. The reader should 
try to put all the equations in this chapter into general tensor equations in 
curvilinear coordinates. 

6.1. GENERALIZED HOOKE’S LAW 

With the introduction of the concepts of stress and strain, Cauchy gen¬ 
eralized Hooke’s law into the statement that the components of stress are 
linearly related to the components of strain. As a tensor equation, the 
generalized Hooke’s law may be written in the form 

(1) A = D ijkl e H 
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where is the stress tensor, is the strain tensor, 1 and D ljkl is the 
tensor of the elastic constants , or moduli, of the material and is called the 
elastic modulus tensor. Inasmuch as = <j j \ we must have 

(2) D ijkl = D jikl . 

Furthermore, since e^y = eik, and in Eq. (1) the indices k and l are dummy 
variables, we can always symmetrize D tjkl with respect to k and l without 
altering the sum. Hence, without loss of generality, we may assume that 

(3) D ijkl = D ijlk . 


According to these symmetry properties, the maximum number of indepen¬ 
dent elastic constants is 36. 

If there exists a strain energy function W, 


( 4 ) 


W = i^ fe, e y e w , 


with the property 

( 5 ) 


aw 

den 


then we can always assume that the quadratic form Eq. (4) is symmetric, 
and it follows that 

(6) D ijkl = D klij . 


Under the symmetry condition, Eq. (6), the number of independent elastic 
constants is reduced to 21. The question of the existence of the strain 
energy function is discussed in Chapter 12. 

If a material possesses further symmetry in its elastic property, the 
number of independent elastic constants will be reduced. For example, 
if the material exhibits symmetry with respect to a plane, the number of 
independent elastic constants becomes 13. If there is symmetry with respect 
to three mutually perpendicular planes, the number becomes 9. However, if 
the material is elastically isotropic, i.e., if its elastic properties are identical 
in all directions, then the number of independent elastic constants reduces 
to 2. 

The study of crystal symmetry is a very interesting subject. Many 
excellent references exist. See Love, 11 Green and Adkins, 1 - 2 SokolnikofF. 1 - 2 

In the following discussion, we shall limit our attention first to isotropic 
materials. Since we are going to use Cartesian tensors in the remaining 


1 If the displacement u, is infinitesimal, e,j = (uij + «j,i)/2. If U{ is finite, is the 
Almansi strain tensor of Sec. 4.1. See also Sec. 16.6, 16.7. 
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chapter, we shall just use subscripted indices to indicate tensorial 
components. 


6.2. STRESS-STRAIN RELATIONSHIP FOR ISOTROPIC 
ELASTIC MATERIALS 

For an isotropic elastic material in which there is no change of temper¬ 
ature, Hooke’s law referred to a set of rectangular Cartesian coordinates 
may be stated in the form 

(1) cr aa = 3Ke aa , 

(2) <■ = 2GeC , 

where K and G are constants and er', and ej, are the stress deviation and 
strain deviation, respectively; i.e., 

(3) = CTjj ^<r aa 6ij , 

(4) e^j - eij — -e aa Sij . 

We have seen before that cr aa / 3 is the mean stress at a point and that, if 
the strain were infinitesimal, e QQ is the change of volume per unit volume. 
Both cr aa and e aa are invariants. Thus, Eq. (1) states that the change of 
volume of the material is proportional to the mean stress. In the special 
case of hydrostatic compression, 


Gxx — ® yy — @zz — Pi rr X y — ®yz ~~ ®zx — d > 


we have a aa — — 3p, and Eq. (1) may be written, in the case of infinitesimal 
strain, with v and An denoting volume and change of volume, respectively, 


(5) 


An p 
v K 


Thus, the coefficient K is appropriately called the bulk modulus of the 
material. 

The strain deviation eC describes a deformation without volume change. 
Equation (2) states that the stress deviation is simply proportional to the 
strain deviation. The constant G is called the modulus of elasticity in shear, 
the shear modulus, or the modulus of rigidity. In the special case in which 
e x y / 0, but all other strain components vanish, we have 




— 2Ge xy , 


(6) 
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whereas all other stress components vanish. The coefficient 2 is included 
because, before the tensor concept was introduced, it was customary to 
define the engineering shear strain as j xy = 2e xy . 

If we substitute Eqs. (3) and (4) into Eq. (2) and make use of Eq. ( 1 ), 
the result may be written in the form 


(7) 


(T ij — Ae aa <5jj -(- 2 Gcij , 


( 8 ) 


eij 


1 + u v 

g a ij gGaaOij . 


The constants A and G are called Lame’s constants (G. Lame, 1852). In 
many books the symbol p is used in place of G. The constant E is called 
the modulus of elasticity, or Young’s modulus (Thomas Young, 1807). The 
constant u is called Poisson’s ratio. The relationships between these con¬ 
stants are 


x 2 Gu G(E - 2G) 2 Ev 

~l~2v~ 3 G-E ~ 3 G ~ (1 + v)(1-2v) 

3Ku 3K{3K - E) 

~ 1 + v ~ 9K-E ’ 

A(1 — 2 v) 3 E 3K(1 — 2v) 3 KE 

2u ~ 2 {K ^ — 2(1 + u) 2(l + i/) ~9K-E' 

A A E 3K-2G 3K~E 

V ~ 2(A + G) ~ (3 K - A) “ 2 G ~ 2(3 K + G) ~ 6K ’ 

«, G(3X + 2G) A(l + +)(l-2 u) 9K(K - A) 

~\Tg~ = - V - = 3K-X - 2G ( 1 + I/ ) 


9 KG 
3 K + G 


= 3K(1 - 2u ), 


K _ \ , A(1 + u) 2G{1 + t/) _ GE 

+ 3 3v 3(1 — 2i/) 3(3 G-E) 


E 

3(1 - 2 v) ' 


To these we may add the following combinations that appear frequently. 


G 

X + G 


X v 


( 10 ) 


= 1 - 2v , 


A + 2 G 1 - v ‘ 
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In unabridged notation, Eq. (8) reads 
1 


( 11 ) 


E 


0XX - v{v yy +a zz )} 


e yy ~ j^\?yy v{a xx + o zz )\, 

e ZZ 1 ~j^\GzZ “H rTyy)] , 

1 + 1 / 1 

e xy ~ 'g &xy = ~^Q ax V ’ 

1 + 1 / 1 

e yz ~ ^ a yz = 2-Q^y* ’ 

l+i/ 1 

e zx — ^ &ZX — ~^Q° ZX • 


One can express er’s in terms of e’s 
E 

( 12 ) = (l+v)(l-2») 


[(1 - u)e xx + u{e yy + e zz )], 


(Jxy — 2Gc xy , 

with corresponding expressions for <r yy , o zz , a yz , a zx . Table 6.2:1 gives the 
average values of E , G, and v at room temperature for several engineering 
materials which are approximately isotropic. 


Table 6.2:1 



E, 

10 6 Ib/sq in. 

G, 

10 6 !b/sq in. 

V 

Speed of sound 
(Dilatational wave) 
10 3 ft/sec 

Metals: 





Steels 

30 

11.5 

0.29 

16.3 

Aluminum alloys 

10 

2.4 

0.31 

16.5 

Magnesium alloys 

6.5 

2.4 

0.35 

16.6 

Copper (hot rolled) 

15.0 

5.6 

0.33 


Plastics: 





Cellulose acetate 

0.22 



0.36 

Vinylchloride acetate 

0.46 



5.1 

Phenolic laminates 

1.23 


0.25 

8.2 

Glass 

8 

3.2 

0.25 


Concrete 

4 


0.2 
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In 1829, Poisson advanced arguments, which were found untenable later, 
that the value of v should be 1/4. The special value of Poisson’s ratio 
v = 1/4 makes 

(13) \ = G 

and simplifies the equations of elasticity considerably. Consequently, this 
assumption is often used, particularly in geophysics, in the study of compli¬ 
cated wave-propagation problems. The special value v = 1/2 implies that 

(14) G=\e, 4=°> — =e««=0. 

o li V 

So far in this chapter we have discussed the simplest of all stress-strain 
relationships for elastic materials — the Hooke’s law, which are the me¬ 
chanical properties of a class of materials like steel. There are of course 
also many things in the world whose mechanical properties cannot be de¬ 
scribed by the simple elasticity discussed above. For example, almost all 
biological materials lie beyond the reach of these simple laws. Even rub¬ 
ber, plastics, and metals like aluminum, magnesium, and lead cannot be 
so described. Many materials obey Hooke’s law when the stress and strain 
are sufficiently small, but yield and flow plastically when a critical condi¬ 
tion of yield is reached and maintained. The behavior of materials beyond 
their elastic limit is complicated. In the rest of this book, we first demon¬ 
strate the classical methods that can solve boundary-value problems of 
bodies whose materials obey Hooke’s law; then consider the world beyond. 
We will study plasticity, thermodynamics, thermoelasticity, irresversible 
thermodynamics, viscoelasticity, finite strains, large deformation, elasticity 
with nonlinear stress-strain relationship, plates and shells, and numerical 
methods to handle boundary-value problems involving materials with these 
mechanical properties. The final mathematical structure of the theories ren¬ 
ders boundary-value problems solvable by the numerical methods that will 
be proposed. 

6.3. IDEAL PLASTIC SOLIDS 

Metals obey Hooke’s law only in a certain range of small strain. When a 
metal is strained beyond an elastic limit, Hooke’s law no longer applies. The 
behavior of metals beyond their elastic limit is rather complicated. We shall 
give a very brief outline of some experimental facts in the next section, and 
we will discuss the formulation of the constitutive relations in the plastic 
regime later. However, before engaging in a long and involved discussion, 
let us present a set of laws which defines the simplest plastic materials — an 
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ideal plastic solid obeying von Mises’ yield criterion and flow rule. That 
such a set of laws is a reasonable abstraction of the behavior of certain real 
materials will be discussed later. At the moment, we shall just exhibit the 
minimum ingredients that constitute a theory of plasticity. 

In Secs. 3.12 and 3.13, we defined the stress deviation which has a 

l J 

second invariant J 2 defined as 

( 1 ) h ■ 

In a similar way, the strain deviation e C is defined by subtracting the mean 
strain from the strain tensor ey, 


( 2 ) 


If the material is isotropic and obeys Hooke’s law, then 


(3) 


1 / 

2 G ij ' 


When an increasing deformation reaches a certain limit, called the elastic 
limit, the material starts to deform plastically, and eC is no longer given by 
Hooke’s law. In this case we define the plastic strain increment de^ as the 
increment of the actual strain deviation deC minus the increment of the 
elastic strain de'^ computed from Hooke’s law as if it still applied, de'^ 
is called the elastic strain increment. For metallic materials experimental 
evidence indicates that the hydrostatic pressure has little effect on plastic 
yielding and that the volumetric change of the material is negligible during 
plastic yielding. Hence one can assume de^ — 0. Thus de^ can be 
considered as the increment of the plastic strain deviation and be written 
in the form 


(4) de$ = de' tj - deff = de.d - , 


de[^ — 0 . 


where (•)' denotes the deviation of the associated quantity in the paren¬ 
thesis. The total strain deviation increment may now be written as 


(5) 


de' u = de$ + 


d <i 

2 G ' 


Equation (5) becomes a rate equation of deformation when all incremental 
quantities d(-) are replaced by rate quantities (■), 
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The rate of plastic deformation is e[j >) and the increment of the plastic strain 
in the time interval dt is e\^dt. Then the total plastic strain deviation after 
successive stages of yielding is the algebraic sum of the deformations that 
occur at all stages: 

(6) =ej ) (0)+ / ef){t)dt, 

Jo 

where e-^(0) is the initial value of at time t — 0. A theory of plasticity 
is formulated by specifying how can be computed. 

For an ideal plastic solid obeying von Mises ’ yield criterion and flow rule 
the specifications are as follows: 


(a) Hooke’s law holds for the mean stress and the mean strain at all 
times, i.e., 

Hence, the plastic strain is incompressible (e^ = 0) and the plastic 
strain deviation tensor is the same as the plastic strain tensor. This 
justifies the notation ejj^ in places where normally we would write 
to indicate a plastic strain deviation. 

(b) The material is elastic and obeys Hooke’s law as long as the second 
invariant J 2 is less than a constant k 2 . In other words, no change 
in plastic strain can occur as long a s J 2 < k 2 . 

eff = 0 when J 2 < k 2 . 


(c) Yielding occurs (the elastic limit is reached) when and only when 
J 2 = k 2 . When the yield condition J 2 — k 2 = 0 prevails, the rate of 
change of the plastic strain is proportional to the stress deviation. 


Ap) 


u 

p ■ 


p > 0, 


where p is a positive factor of proportionality. 

(d) Stress state corresponding to J 2 > k 2 cannot be realized in the 
material. 


The set of laws above contains two essential parts: the criterion for 
yield, and the stress-strain relation in the elastic and plastic regimes. In the 
specifications above, the yield condition is based on the second invariant 
of the stress deviation tensor. This yield criterion was first proposed by 
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von Mises. The constant k can be identified with the yield stress in simple 
shear or simple tension. 

We notice that only the rate of plastic strain is specified by these laws. 
Under a varying loading program, successive plastic strain increments must 
be added together algebraically according to Eq. (6). Such a theory is called 
an incremental theory. 

In many applications of the theory of plasticity the rate at which plastic 
flow occurs is relatively small. The concern is mostly with the total amount 
of plastic flow. In such applications we often assume that the plastic de¬ 
formation is rate insensitive and that we can replace the equation in (c) by 
the incremental law 

de\f = Xa 'ij - A > 0, 

where A is a factor of proportionality and not a characteristic constant of 
the material. The factor A can be a function of and (T t j, but not &ij. 
The sign of A is determined by the fact that plastic flow involves dissipation 
of energy. Under a given set of loading conditions the value of A, and hence 
the total plastic flow, is determined by the total work done by the external 
load. 

Actual materials may exhibit much more complicated plastic behavior 
as specified above. Many yield conditions and plastic flow rules have been 
proposed. To prepare for the study of these formulations, we shall review 
briefly below some basic experimental facts. 

6.4. SOME EXPERIMENTAL INFORMATION 

Simple Tension Tests. If a rod of a ductile metal is pulled in a testing 
machine at room temperature, the load applied on the test specimen may 
be plotted against the elongation, 


where Iq is the original length of the rod and l is the length under load. 
Numerous experiments show typical load-elongation relationships, as indi¬ 
cated in the diagrams of Fig. 6.4:1. The initial region appears as a straight 
line. This is the region in which the law of linear elasticity is expected to 
hold. Mild steel shows an upper yield point A*, a lower yield point A, and 
a flat yield region AB [Fig. 6.4:l(a)], which is caused by many discontinu¬ 
ous steps of microscopic slip along slip planes of the crystals. Most other 
metals do not have such a flat yield region [Fig. 6.4:1(b)]. Metals such 
as aluminum, copper and stainless steel exhibit a gradual transition from 
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Pig. 6.4:1. Typical load-elongation curves in simple tension tests. 

a linear elastic to a nonlinear plastic behavior instead having pronounced 
yield points as shown in Fig. 6.4:l(a). 

Upon unloading at any stage in the deformation, the strain is reduced 
along an elastic unloading line. Reloading retraces the unloading curve with 
relatively minor deviation and then produces further plastic deformation 
when approximately the previous maximum stress is exceeded. During 
the loading process, the test specimen may “neck” at certain strain, so 
that the cross section is reduced in a small region. When necking occurs 
under continued elongation, the load reaches a maximum and then drops 
down, although the actual average stress in the neck region (load divided 
by the true area of the neck) continues to increase. The maximum M 
in Fig. 6.4:1(a) is the ultimate load. Beyond the ultimate load the metal 
continues to flow. At point C in the curves of Fig. 6.4:1 the specimen 
breaks. 

Materials like cast iron, titanium carbide, beryllium, or any rock mate¬ 
rial, which allow very little plastic deformation before reaching the breaking 
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point, are called brittle materials. The load-strain curve for a brittle mate¬ 
rial is given in Fig. 6.4:1(c). Point C is the breaking point. 

A fact of great importance for geology is that brittle materials such as 
rocks tend to become ductile when subjected to large hydrostatic pressure 
(large negative mean stress). Theodore von Karman (1911) demonstrated 
this in his classical experiments on marbles. Simple compression and simple 
shear tests of cylindrical specimens give load-strain diagrams similar to 
those of Fig. 6.4:1. Bending of a beam with a shear load is often used to 
test the behavior of a material in tension, compression, and shear combined 
in a specific way. 

Bauschinger Effect. When a metal specimen is subjected to repeated 
tension-compression loads with deformation exceeding the elastic limit, 
the load-deflection curve sometimes appears as in Fig. 6.4:2. The tension 
stroke and the compression stroke are dissimilar. This is referred to as the 
Bauschinger effect, after J. Bauschinger’s basic paper on strain hardening 
published in 1886. 




Fig. 6.4:2. Bauschinger effect in a simple tension-elongation test. 

Anisotropy. Plastic deformation in metal is a result of slip along certain 
crystallographic planes. The process is clearly directional. As a conse¬ 
quence, any initial isotropy, which may have been present is usually de¬ 
stroyed by plastic deformation. From the point of view of the dislocation 
theory, slip is an irreversible process; every slip produces a new material. 
These changes are revealed in the Bauschinger effect and in the anisotropy 
of materials after plastic deformation. 

Time and Temperature Effects. The results described above are typi¬ 
cally obtained by slow application of the load in a testing machine. Higher 
strain rate or loading rate can have a pronounced effect on the material 
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behavior in the plastic regime. Increasing loading rate generally decreases 
the ductility of the material, and increases the initial and subsequent yield 
stresses. Under rapid loading, the hereditary nature of the material usually 
reveals itself. If the hereditary stress-strain law is linear, it is often described 
as anelasticity or viscoelasticity (see Chapters 1 and 15). If the strain is 
large, the hereditary stress-strain laws for metals are generally nonlinear. 

The time-dependent plastic flow 
is often described as creep. A sim¬ 
ple tension specimen of lead wire 
under a constant tension load shows 
a creep curve as illustrated in 
Fig. 6.4:3. Following an initial exten¬ 
sion, the rate of strain first decreases 
gradually, then remains nearly con¬ 
stant for a while, and finally ac¬ 
celerates until the specimen breaks. 

The three stages of creep are called, 
respectively, the primary , the sec¬ 
ondary, and the tertiary creep. 

Andrade 64 has pointed out, 
however, that if a creep test is per¬ 
formed at constant stress, the final 
stage of accelerated elongation may 
or may not appear, depending on 
whether the material deteriorates or not. As shown in Fig. 6.4:3, the creep 
curve at constant load differs considerably from that at constant stress. 

The creep rate and length of the primary and secondary stages are 
dependent on temperature and stress level. At high temperature and high 
stresses the secondary creep stage may not be appreciable. 

The phenomenon of creep is important in geophysics and in those en¬ 
gineering problems in which accurate dimensions must be maintained over 
a long period of time. It is a controlling factor for the design of machinery 
that operates at high temperature. On the other hand, plastic deformation 
generates heat, and a very rapid application of load is usually associated 
with a large temperature gradient, localized plastic flow and anisotropic ef¬ 
fects. Continuous metal cutting is a good example of large localized plastic 
deformation. Temperature, as strain rate, generally has a very significant 
influence on material properties. As temperature increases, the ductility of 
metals usually increases while its stiffness decreases. 

Size Effect and Stress Inhomogeneity. Since practically all materials 
have a nonhomogeneous microscopic structure, plastic properties of a small 



Fig. 6.4:3. Creep test of a lead wire 
in tension. The initial lengths and ini¬ 
tial loads were the same in both tests. 
(After Andrade, 1910; courtesy of the 
Royal Society, London.). 
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region may be different from those of a larger region. This considera¬ 
tion becomes important in the question of stress concentration in notched 
specimens. Size effect is also important for micron- and submicron struc¬ 
tures in micro electronic applications as the strain gradient can strongly 
influence the structural behaviors. 

Combined Stress. One of the most important tasks of mathematical 
theory is to extend and generalize the experiences of simple experiments 
and to suggest crucial tests to verify the basic assumptions of the theory. In 
plasticity, this function of mathematics is particularly evident, because here 
one must consider the stress and strain tensors; yet no direct observation 
of all components of these tensors is possible. 

A relatively simple combined-stress experiment employs a thin-walled 
circular cylinder. With the combination of an axial tension, a twisting 
moment about the cylinder axis, and an internal or external pressure, a 
good approximation to a biaxial state of stress (involving an, an, 022 
with <731 = 032 = 033 = 0 ) can be produced. Much information has been 
gathered from such experiments. Among other things, it is found that the 
shear stress is by far the major cause of yielding. 

P. W. Bridgman’s renowned experiments on the influence of hydrostatic 
pressure on yielding and other mechanical and metallographical character¬ 
istics of metals have shown that hydrostatic pressures of one order higher 
than the yield stress have practically no influence on yielding of metals. A 
tensile test or shear test was run at atmospheric pressure in the standard 
manner, with load and deformation recorded; and then was repeated by im¬ 
mersion of the whole setup in a chamber under high hydrostatic pressure. 
Bridgman found that the stress-strain curves in the small strain range did 
not change, but the ductility of the material increased greatly, thus permit¬ 
ting a much larger deformation prior to fracture. 

Conversely, very small changes in material density are found when a 
metal is subjected to repeated plastic deformation, indicating that the plas¬ 
tic volume change is small. 

In the subsequent sections, we shall present some of the best known 
theories of plasticity and derive the mathematical relationships between 
stress and strain for plastically deformed solids. 

6.5. A BASIC ASSUMPTION OF THE MATHEMATICAL 
THEORY OF PLASTICITY: THE EXISTENCE OF A 
YIELD FUNCTION 

The theory of plasticity descirbes the mechanical behavior of materials 
in the plastic range including energy dissipation, irreversible and history 
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or path dependent process, the initial yield surface, its subsequent growth, 
the constitutive equations for plastic deformation, and the criteria for load¬ 
ing and unloading. Throughout this chapter, we assume that the plastic 
deformation is not rate sensitive , so that the constitutive equations are in¬ 
variant with respect to time scale and that the rate form and the incremental 
form are equivalent to each other. 

By comparing the laws of linear elasticity and ideal plasticity as de¬ 
scribed in Secs. 6.1 and 6.3 with the typical experimental features described 
in Sec. 6.4, we can determine the applicability and limitations of these math¬ 
ematical laws. We see that near the origin of the stress-strain diagrams of 
Fig. 6.4:1, the linear relationship is good in nearly all cases. This is the 
range over which the material behavior follows the laws of linear elasticity 
as described in Secs. 6.1 and 6.2. For the case of mild steel represented 
by Fig. 6.4:l(a), there is a flat region A-B. This is the feature of ideal 
plasticity that the load remains constant as the elongation increases under 
uniaxial loading. The slightly unstable region on the yield curve at the 
peak A* before the flat range is normally ignored by plasticity laws. The 
rise beyond point B in Fig. 6.4:l(a), as well as the curved portion of the 
curves in Figs. 6.4:1(b) and 1(c), obviously cannot be represented by the 
linear elasticity or ideal plasticity. The features of these curved portions of 
stress-strain curves are said to be features of strain hardening. 2 It is evident 
that the stress-strain laws of strain-hardening will be basically different from 
those of ideal plasticity. If the material is unloaded in this region beyond 
point A, it will follow a path parallel to the initial loading path OA*. As 
a result only part of the strain, called the elastic strain e^ e \ is recovered, 
while the other part of the strain will remain as the permanent or plastic 
strain 

For the purpose of describing the state of stress at any point in the 
material, it is convenient to represent each state of stress by a point in a 
nine-dimensional stress space, with axes <Tij(i,j = 1,2,3). Similarly, a state 
of strain is a point in a nine-dimensional strain space, with components ey. 
In the theory of plasticity, a state of plastic strain eff may be so represented 

2 The material that ia moat qualified to be called elastic is probably natural rubber. But 
if the stress-strain relationship of natural rubber is plotted in the manner of Fig. 6.4:1, 
the resulting curve will not be a straight line. In some rubbers this can be corrected by 
plotting the stress components versus the (finite) Eulerian (Almansi) strains or, equiv¬ 
alently, by plotting the Kirchhoff stress components versus Green’s strain components. 
In such cases, the material is said to be linearly elastic with respect to finite strain. In 
most cases the nonlinearity in the stress-strain relationship remains, although the ma¬ 
terial returns to its original state along the same stress-strain curve when all the loads 
are removed. In this case, the material is said to be nonlinearly elastic. In this chap¬ 
ter, we discuss the strain hardening only under the assumption of small strain. Finite 
deformation is discussed in Chapter 16. 
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also. A program of loading may be regarded as a path in the stress space. 
The corresponding deformation history is a path in the strain space. 

To develop a mathematical theory of plasticity, a basic assumption is 
made that there exists a continuous scalar yield function , T, £,), which 
has the following properties: 

• The equation f(&ij,T,£i) = 0 represents a closed surface, called the 
yield or loading surface, in the stress space oy for a given temperature 
T and an array of internal variables £i,...,£ n ■ 

• The plastic strain-rate e$ and all internal-variable rates £i vanish in 
the region in which /(ay, T, &) < 0. This region is called the elastic 
region , which occupies the interior of the yield surface. 

• The plastic strain rate can be nonzero in the region where 

=0. 

• No meaning is associated with /(ay, T,ii)> o. 

There are two types of internal variables £i,... ,£ n . One type consists 
of “physical” variables, which describe chemical reaction, phase changes, 
or structural defects. Another type consists of phenomenological variables 
including eff itself. Together they characterize the hardening properties 
of the material and, therefore, are also called work-hardening parameters. 
These parameters may depend on the plastic deformation history and their 
rates may be functions of ay, T and £,. 

Work hardening will be considered later. Here let us consider several 
simple examples to see the plausibility of the yield assumption. 

Von Mises Criterion. The von Mises yield condition is defined by the 
yield function 

(1) A /K) = J 2 -fc 2 , 

where A; is a parameter and J 2 is the second invariant of the stress deviation, 
which is in the form 

( 2 ) Ji = ^44 • 

Alternative expressions of J 2 are given in Eq. (3.10:8). For an ideal plastic 
material obeying von Mises’s condition, A; is a constant independent of 
strain history. The stress state is characterized by elastic deformation when 
the condition J 2 < A; 2 , with plastic flow possible only when J 2 — k 2 , whereas 
the condition J 2 > k 2 can never be realized. These are the conditions 
specified in Sec. 6.3. For a work-hardening material, k will be allowed to 
change with strain history. 
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If the material is subjected to a simple shear a 12 while all other stress 
components vanish (a state of stress realizable in torsion of a thin-walled 
tube), then J 2 = a'( 7 , and yielding should occur when 

Cl2 = fc. 

Hence the constant k means the yield stress in simple shear. 


Tresca Criterion. Tresca 6 - 3 (1868) first advanced the idea of yield crite¬ 
rion. Through his work on metal forming in an armory, he concluded that 
the decisive factor for yielding is the maximum shear stress in the material. 
Tresca proposed the criterion stipulating that the maximum shear stress 
has a constant value during plastic flow. 

To express Tresca’s idea analytically, it is the simplest to use the prin¬ 
cipal stresses U\, (7 2 , < 73 . If the principal axes of stress are so labeled that 

(3) <Ji > <r 2 > ( 7 3 , 

then Tresca’s yield condition is 


(4) A 


/ = <7i — 03 — 2k = 0 . 


However, / in this form is not analytic; it violates the rule that the manner 
in which the principal axes are labeled 1, 2, 3 should not affect the form 
of the yield function. To obey this rule, we observe that Tresca’s condition 
states that during plastic flow one of the differences |cri — < 73 1» \ a 2 — < 73 !, 
|<73 — cr 1 ! has the value 2 k. Hence we may write 

( 5 ) / = [(<7i - cr 2 ) 2 - 4 fc 2 J[(<r 2 - 03) 2 - 4 /c 2 ][((7 3 - <7i) 2 - 4 k 2 ] = 0. 

This equation is now symmetrical with respect to the principal stresses, 
and can be put into an invariant form, which is due to Reuss: 

(6) A / = 4— 27J 2 — 36A: 2 J 2 + 96A; 4 J 2 — 64A: 6 = 0, 


where J 2 and J 3 are the second and third invariants of the stress deviation 
tensor with J 2 as given in Eq. (2) and 


J 3 = det(rr^) = -cr^Xi 

= cr ll cr 22 cr 33 + 2(7i 2 <7 2 3<73x — — <7 22 <7 2 i — <7 3 3<7 2 2 . 


Using the results of Problem 3.5 in Sec. 3.10, we can write the Tresca yield 
surface in the form, 


/ = y ^2 sin(a + 60°) — k = 0 
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where 


a = 



3y/3 J 3 \ 
2 J? 2 ) 


with 


- - < a < 
6 


It can be shown that the absolute value of is always less or equal 

J 2 

to one. 

In certain problems, the direction of the principal axes and the relative 
magnitudes of the principal stresses are known by symmetry considerations 
or by intuition. Then it is possible to use the simple form Eq. (4) for 
Tresca’s criterion. In general, however, we cannot tell a priori the relative 
magnitude of the principal stresses, and would be obliged to use the general 
form Eq. (6). 


Example 1 . Comparison of the Tresca and von Mises Criteria. 

Consider a thin-walled tube subjected to the combined uniaxial tensile 
stress <r(> 0) and shear stress r. From Eq. (3.6:7), the principal stresses are 



The Tresca and von Mises yield functions can be simplified as 


/ = a - ay , 


where cry is the yield stress in uniaxial tension, and 

(8) a = cri — az — V a2 + 4t 2 = 2k (Tresca), 

(9) a = y/slz = [(<7i - er 3 ) 2 + <t 2 + erf] 

= 'Jo ' 1 4- 3r 2 = J2k (von Mises). 

If k is the yield stress in simple shear (cr = 0,r = k), then k = oy/V 3 
according to the von Mises criterion in Eq. (9) and k = cry /2 according to 
the Tresca criterion in Eq. (8). 

Example 2. Experimental Investigation of the Tresca and von Mises 
Criteria. 

Lode 6,3 (1925) investigated the influence of the intermediate principal 
stress on yielding. He used the parameter, called the Lode parameter, 
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( 10 ) 


V = 


2 (72 - a 1 - cr 3 
<j\ - <73 


to characterize the stress state where the principal stresses o\ satisfy the 
condition 

( 11 ) cri > a 2 > <73 ■ 


According to the Tresca criterion 


( 12 ) 


&1 ~ <73 
<7 Y 


thus yielding does not depend on r), On the other hand, from Eq. (10), we 
have 


<7\ — (73 . . 

<7i -<7\ = ——W-l), 

<7l ~ <73 , . 

(7 2 -(73 = -g-(»7 + l)- 

The von Mises criterion 

i[(cri - cr 2 ) 2 + (cr 2 - a 3 ) 2 + (<73 - itj) 2 ] = oy 
can be reduced to 

(13) 2 _ 

<rr v/3 + 1 2 

which is a function of the Lode parameter rp Lode (1925) tested thin- 
walled tubes of steel, copper, and nickel subjected to a combined loading of 
uniaxial tensile force F and internal pressure p. The induced stresses are 


<7l 


= (70 = 


pR 

T ’ 


<7l = <7 Z = 


03 = O r W 


F 

2nRt' 

0 , 


where t is the wall thickness and R the mean tube radius. From Eq. (10), 
the Lode parameter is 

F - 7 rR 2 p 
77 ” 7T R 2 P ’ 

with 27 rR 2 p > F > 0 to assure that Eq. ( 11 ) is satisfied. Experimental 
results in Fig. 6.5:1 seem to favor the von Mises yield criterion as the results 
show that the value of {(T\ - a 3 )/cry depends on r/. 
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Lode’s 

experimental 



Fig. 6.5:1. Lode’s comparison of von Mises and Tresca yield criteria (Data from Khan 
and Huang 1995). Dots are Lode’s experimental data. 

Problem 6.1. Apply the yield criteria of von Mises and Tresca to simple 
tension and to simple shear. Show that, if they give the same yield stress in 
simple shear, the tensile yield stress predicted by von Mises’ criterion is smaller 
than that predicted by Tresca’s criterion by a factor \/3/2. 

Problem 6.2. Show that in no other type of stress is the discrepancy be¬ 
tween the predictions of the yield stress by von Mises, and by TYesca’s criteria 
as large as it is in simple tension (von Mises 6 3 1913), if they give the same yield 
stress in simple shear. 


6.6. LOADING AND UNLOADING CRITERIA 


Let us first clarify what loading and unloading in a plastic state mean. 
Consider a plastic state at which the yield function introduced in Sec. 6.5 
vanishes, i.e., 

( 1 ) 0 . 

The time rate of / is 

*L h + 21; .W* 

don ^ ddi dti ' 


(2) 


/ 


Obviously, / = 0 and / < 0 at a time t would imply / < 0, the next instant 
of time. Such a change leads to an elastic state and is a natural attribute 
to the term unloading. However, we also require that in an unloading 
process there is no change in the internal-variables and temperature, i.e., 
= T = 0. Hence, by Eq. (2) we stipulate that the criterion for unloading 
from a plastic state at constant temperature is 
df 

——&ij < 0 , / = 0, during unloading. 

OO ij 


( 3 ) 
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Otherwise it is said to be loading or neutral loading. Thus, 

(4) ^ oij = 0, / = 0, during neutral loading; 

U(T{j 

(5) > 0 1 / = 0, during loading. 

0(7ij 

The function / is also called a loading function because of its prominence 
in these loading criteria. 

A simple geometric interpretation of these criteria exists. Since the yield 
surface, i.e., the loading surface, is assumed to be a closed surface, we can 
speak of its inside and outside. Then, for a state of stress on the loading 
surface, loading, unloading, or neutral loading takes place, according to 
whether the stress increment vector is directed outward, inward, or along 
the tangent to the loading surface, respectively. Because of this geometric 
interpretation (also for reasons to be discussed in Sec. 6.9), it is important 
to obey the sign convention in writing Eq. (1) so that df fdo^ be directed 
outwardly normal to the surface / = 0. 

6.7. ISOTROPIC STRESS THEORIES OF YIELD FUNCTION 

A material is said to be isotropic if there is no orientation effect in 
the material. If the yield function / depends only on the invariants (with 
respect to rotation of coordinates) of stress, strain, and strain history, then 
the plastic characteristics of the material is isotropic. If the yield function 
/ is an isotropic function of the stress alone, then the theory of plasticity 
is called an isotropic stress theory. In such theories 

a) f=m,h,h), 

where Ii, 1 2 , I 3 are the three invariants of the stress tensor &ij. Equiva¬ 
lently, we may write Eq. (1) in terms of principal stresses, 

( 2 ) / = /(^i, 0-2, <73)- 

If the principal stresses cri, cr 2 , 03 are taken as the coordinate axes, the 
surface f(a 1 , 02 , 03 ) = 0 ca n be plotted in a three-dimensional stress space. 
For example, if we take / = J 2 — k 2 , with k being a constant, the surface 
/ = 0 appears as a circular cylinder whose axis is equally inclined to the 
coordinate axes, as shown in Fig. 6.7:1. 
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Axis is the line 



Fig. 6.7:1. A yield surface in the prin¬ 
cipal stress plane. 


According to P. W. Bridgman 6 ' 3 
(1923), the plastic deformation of met¬ 
als essentially is independent of hy¬ 
drostatic pressure; Crossland 6 ' 3 (1954) 
and many others have verified this re¬ 
sult. If this conclusion is accepted, the 
yield function will be independent of 
h = <J i + <7 2 + o 3 . Then it is ad¬ 
vantageous to introduce the stress de¬ 
viations erP = c-ij - ^jOTfc/3. Let 
J 2 , J-j, be the second and third in¬ 
variants of the stress deviation aL. 
Then we have / = /(J 2 , -Li)- Von 
Mises took the simplest of such func¬ 
tions, assuming / = J 2 — const. The 


surface /(J 2 ) = 0, when plotted in the space of principal stresses, will be a 
cylinder perpendicular to the plane 


(3) Ii — (j\ 4- <r 2 + <7 3 = 0. 


Its axis is equally inclined to the coordinate axes <ti, ct 2 , < 73 , but its cross 
section is no longer circular if J 3 participates in /. 

When / depends on J 2 , J 3 alone, it can be written in the form 
f(<Ti — cr 3 , <j 2 — £73). Then a two-dimensional plot of the surface / = 0 
is possible, with (J\ — <73, cr 2 — cr 3 as coordinate axes (see Fig. 6.7:2). 

Another way of representing a yield surface when it is unaffected by 
hydrostatic pressure is to project the yield surface on the so-called n-plane, 


<r 2 -<r. 



Fig. 6. 7:2. Yield surface plotted on 
the plane of — < 73 ), (cr 2 — 0 - 3 ). 





Fig. 6.7:3. Projection of yield surfaces 
on the 7r-plane. 
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ai +(72+ o '3 = 0 . For example, von Mises’s criterion would appear as a circle 
on this plane, and Tresca’s criterion would appear as a regular hexagon, as 
shown in Fig. 6.7:3. The rr-plane projection will be used extensively below 
in discussing the flow and the hardening rules in plasticity. 

6.8. FURTHER EXAMPLES OF YIELD FUNCTIONS 

Let a[j stand for the stress deviation and J 2 , J 3 for the second and 
third invariants of a+, respectively. Von Mises’ and Tresca’s yield func¬ 
tions discussed in Sec. 6.5 imply initial isotropy, and equality of tensile 
and compressive yield stresses at all stages of the deformation. The more 
general expression 

(1) f = F(J 2 ,J 3 )-k 2 

still contains no Bauschinger effect. A simple example is 

( 2 ) F = Jl-cJi. 

A good correlation with Osgood’s experimental data 6 - 3 (1947) is obtained 
by taking c = 2.25. 

To include Bauschinger effect, but preserve isotropy, the following yield 
functions have been suggested. 

(3) / = F( J 2 ) — mcr'^e^ — k 2 , m, a constant, 

(4) f = F(J 2 ,J 3 )-m<r> j e ( * ) -k 2 , 

(5) / = F(J 2 , J 3 ) - H(J„, JaK-cif - k 2 , 

(6) / = F(J 2 , J 3 ) - [P(J 2 , JsK + Q(h, - k 2 , 

where 

2 

( 7 ) tij — (7 i f c (if.j — —J 2 Sij . 

Anisotropic Materials. As mentioned before, for general anisotropic 
materials, the yield criterion has to be expressed in terms of the nine com¬ 
ponents of the stress tensor < 7 . Following Betten 6,6 (1982, 1988), we assume 
the initial yield function in a quadratic form 

(8) f(Hijki<Tij<Tki) = 0, 

where Hijki are the components of a fourth-order material tensor. Bau¬ 
schinger effect is still excluded. Due to the symmetry conditions 

Hijki = Hklij ~ Hjikl — Hijlk , 
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then Hijki are composed of 21 independent constants. If we assume in¬ 
compressibility for the plastic deformation, the number of independent 
constants reduces to 15 and the yield criterion can be written as 

(9) / {Hijki&ijGki) = 9 • 

Several examples follow. 

Hill’s Criterion for Anisotropic Materials. Equation (9) reduces to 
Hill’s yield criterion, if 

(10) 2 = H ljkl a' i:j (j' kl -1=0, 
where 

#1111 = G + H , #2222 = H + F , #3333 = F + G , 

(11) #1122 = —# , #2233 = — F , #3311 = — G , 

#2323 = — , #3131 = — , #1212 = , 

in which F, G, #, L, M, and N are material constants. Written out 
in extenso , Hill’s equation is in the form 

(12) 2 f(<nj) = F(a yy - a zz ) 2 + G(a zz - a xx ) 2 + H(a xx - a yy ) 2 

+ + 2 Mg 2 zx + 2 N a 2 y — 1. 

Mohr-Coulomb Criterion for Pressure-Sensitive Materials. A yield 
criterion depending on the mean stress is necessary when it applies to soils, 
rocks, concrete, or porous materials. One such criterion occurs in the Mohr- 
Coulomb theory of rupture. This criterion postulates that yield occurs in 
a body on a plane on which the normal and shear stresses reach a critical 
combination, as in dry friction between surfaces. The critical condition can 
be expressed by two bounding curves represented by the equations 

(13) f = r^g(a) = 0, 

where a and r are the normal and shear stresses on the failure plane. If the 
state of stress (a, t) is described by points in a shaded area bounded by the 
three Mohr’s circles shown in Fig. 3.9:1 in Chapter 3, then the bounding 
curves r = Fg(a) can be drawn on the (cr, r) plane. A material is safe if 
its state of stress lies within these bounding curves. The state of stress is 
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critical if the bounding curves become tangent to the largest of the three 
Mohr’s circles. 



Fig. 6.8:1. Mohr-Coulomb yield criterion. 


A special case of the Mohr-Coulomb criterion is that the bounding 
curves are straight lines, 

(14) g(a) = c - a tan <j) 

with c being a material constant and <}> the angle of internal friction. On 
the ( 0 , t ) plane, we can represent the criterion graphically by the straight 
line BC tangent to the largest Mohr’s circle. See Fig. 6.8:1. Let 02 , 03 
be the principal stresses with or > 02 > < 73 , then 


01 - 03 


= d- 


<7l + 03 


sin 4> = ( c cot <j> 


01 + 03 


Hence the yield surface Eq. (13) can be expressed in terms of the max¬ 
imum and minimum principal stresses 01 , 03 as follows: 

01 


(15) 


f = 


2 2 
From the Mohr’s circle, we have 


03 , 0T + 03 . , , n 

H--— sin <p — c cos <p — 0. 


(16) 

(17) 


01+03 , 01—03 . 

+ — 2 ~ Sm ' 
01—03 , 


2 
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Eliminating a\, 03, we can rewrite the yield surface Eq. ( 15 ) in the form 


r = c — 0 tan cf >, 


which reduces to the Tresca criterion if 4 > = 0 and c = ry = cry /\/ 3 . Let 


( 18 ) 

( 19 ) 


2c cos 4 > 

1 + sin 4 > ' 
2c cos cj> 

1 — sin cj) 


Equation ( 15 ) becomes 

( 20 ) 


Qi 03 
a t a c 


in which at,a c are identified as the yield strengths in tension and in com¬ 
pression, respectively. Note that a t < a c . From Eqs. ( 18 ) and ( 19 ), we can 
express the parameters c, <j) in terms of a t ,a c : 


4 > = sin 


( ° c ~ Gt \ 

\a c + a t ) 


with 


0 


c — 2 V a c a t • 


Thus c, <j> can be obtained from measured values of at, a c in uniaxial tension 
and compression tests. 

Equation ( 20 ) is a straight line between the positive 01-axis and the 
negative (73-axis on the octahedral plane for a fixed first stress invariant I\. 
The line intersects the positive (7i-axis at 


( 21 ) 


3 4ccos </> + 7i(1 — sin<^), 

a\ - (ojo = - -(> 0) 

3 + sin <b 


which is derived by a substitution of 203 = I\ — <j\ into Eq. ( 15 ). In other 
words, at the intersection, <72 = 03 and <J\ -f <72 + 03 = I\. Similarly we find 
the intersection of the line with the negative (73-axis to be 


( 22 ) 


03 = — (03)0 


4 ccos</> — 7 i(l + sin <f>) 
3 — sin <j> 


(< 0 ), 


which is obtained by substituting 2(7 i = I\ — cr 3 in Eq. ( 15 ), i.e. 02 = 0^1 
and cti + (72 + <73 = /i- It can be shown that (03)0 > (01)0. 

The yield surface is an irregular hexagon on the octahedral plane or 
ir-plane as shown in Fig. 6 . 8 : 2 . The yield locus is symmetric about the 
0ii02,03 axes and can be obtained by the symmetry condition. 



Drucker-Prager 
yield surface 



and Drucker-Prager yield surfaces 
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Using the results of Problem 6.3 at the end of this section, we can write 
the yield surface Eq. (15) in the form 

(23) / = ^ sin^+ \fh. sin(# +60°) + —-= cos(0 + 60°)sin^> — ccosf> = 0 

3 \/3 

where I\ is the first stress invariant, J 2 is second deviatoric stress invariant 
given by Eq. (3.10:8), and 6 is the angle between the principal deviatoric 
stress axis a[ and the projection vector of the stress state on the octahedral 
plane. Equation (23) is valid only for 0 < 6 < 60° (in the region between 
the positive ay-axis and the negative cr 3 -axis where 04 > 02 > 03 ). For 
other values of 0 , the yield locus is obtained by the symmetric condition 
about the principal axes uq, 02 , 03 . 

Betten’s Form for Pressure Sensitive Materials. Betten (1982) 
introduced the following form of yield function for pressure sensitive 
materials, 

(24) / = ~ 3ijb){&kl ~ &klb) ~ ^ ■ 

If the material is isotropic, H^ki can be written as 

(25) Hijkl — 4” R-24“ foil&jk) * 

A substitution of IJijki above into Eq. (24) yields 

(26) / = 02 J 2 4~ (3oi + 2 ^ 2 ) — 2 bl\ + 36^ — k? 

— ^ 2(^2 4 - bilf 4- 62/1 4- ^ 3 ) = 0, 

where 6j , 62 , and 63 are material constants and Ii is the first stress invariant. 

Drucker-Prager Criterion. Equation (26) reduces to the Drucher- 
Prager yield function 

(27) / = y/h + cih -« = 0, 

if b 1 = —cf, 62 = 2cik, and 63 = — k 2 . The yield locus on any octahedral 
plane (fixed Ii) is a circle. If the radius of the circle is ( 03)0 as given in 
Eq. ( 22 ) on all octahedral planes, we have 

4c cos </> _ 1 sin </> -f 1 

\/3(3 - sin 4>) ’ 1 y/3 3 - sin <j>' 
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If the radius equals to ( 01)0 as defined in Eq. (21) on all octahedral planes, 
we find 

4c cos 0 1 sin 0—1 

\/3(3 + sin 0) ’ \/3 3 + sin 0 

The yield functions discussed up to this point contain no Bauschinger 
effect. However, the yield function 

( 28 ) /{HijkiOijOki) = Hijki(<Tij - me[f)(a k i - me { $) - k 2 

exhibits the Bauschinger effect, which is controlled by the constant m. The 
initial anisotropy is not preserved during deformation. Yield functions such 
as this will be discussed more fully in Sec. 6.12. 

All these examples use a single analytic function to represent the entire 
yield surface. If a yield surface is composed of piecewise smooth surfaces, 
which meet to form corners, it would be convenient to use a separate ex¬ 
pression for each of these piecewise smooth surfaces. This concept leads to 
Koiter’s generalization (see Sec. 6.10). 

Problem 6.3. Let <r(, 0 -(, 03 be the projections of the principal-stress axes 
0 T, 02,03 on an octahedral plane and n[ be the unit vector in direction. The 
components of n[ with respect to the principal axes can be expressed in the form 
[cos(90° — a), —a, —a], where 90 ° — a is the angle between the 01 - and <r(-axes 
with cos a = l/%/3 and “a" is a constant. (An octahedral plane is the plane 
whose normal makes equal angles a with each of the principal axes. Also n( 
makes equal angles with the o ' 2 , 03 -axes.) 

(a) Show that n\ = 78 ( 2 . -W)' 

(b) Let Si be the principal deviatoric stress corresponding to 0 *. Show that 


01 

02 

_ h 

Q 

'1' 

1 

+ 

'Si' 

Si 

_ h 

3 

' 1 ' 

1 


cos 6 

cos(120° — 9) 

.03. 

O 

1 


S3 


1 


cos(120° + 9) _ 


where I\ is the first stress invariant, J 2 = (S\ + S 2 + Si)/2 is the second 
deviatoric-stress invariant, and 9 is the angle between the uj-axes and the 
vector representing the projection of the stress state on the octahedral 
plane. This result shows that one can choose Ji, J 2 ,d as the coordinate 
axes to represent a stress state. [Hint: The components of the projection 
of a stress state on the octahedral plane with respect to the 01 , 02 , 03 -axes 
are (Si, S2, S3) and the magnitude of the projection is J 
(= V 2 J 2 )■ Also \JU 2 cos 9 = (Si, S2, S3) ■ ni]. 
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Problem 6.4. Show that the line given by Eq. (20) intersects the <Ti, co¬ 
axes at (<ti)o, (03)0 as given in Eqs. (21) and (22), respectively. [Hint: At (cri)o, 
the deviatoric stress S 3 = — Si/2. Thus, or = -h + S\ and <73 = ^ at 

(7 1 = (<7i) 0 . Show that Si = — 3(3 Vsin t) ^ rom (6-8:15). Similarly 

at (<7 3 ) 0 , Si = -^-(< 0), ffi = Q ^ and <73 = -^ + S 3 . Show that S 3 = 

_ 4c cos 4> | 4/i sin <j> i 

3-sin <f> 3(3 —ain <p) 

Problem 6.5. Show that the yield stresses for the Drucker-Prager criterion 
in simple shear, tension and compression are, C 2 , \/3 c2/(1 + \/3ci) and \/3c2/(l — 
a/ 3 ci), respectively. For this criterion to be physically meaningful, we must have 
c 2 > 0 and \/3ci < 1. 

Problem 6 .6. The Mises-Schleicher criterion is 

/(t7ij) = 3J 2 + (cr c - a t )h - (7 c <r t = 0. 

Show that (7 C and a t are, respectively, the compressive and tensile yield stresses 
in uniaxial tests. 

6.9. WORK HARDENING — DRUCKER’S HYPOTHESIS 
AND DEFINITION 

Work hardening in a simple tension experiment means that the stress 
is a monotonically increasing function of increasing strain. To generalize 
this concept, D. C. Drucker 6,3 (1951) considers the work done on a mate¬ 
rial element in equilibrium by an external agency, which slowly applies a 
set of self-equilibrating forces and then slowly removes them. This external 
agency is to be understood as entirely separate and distinct from the agency 
which causes the existing state of stress. This process of application and 
removal of the additional stress is called a stress cycle. Removing the addi¬ 
tional stress enables the stress of the body to return to the original stress 
state, but the strain state can be different if plastic deformation occurred 
during the stress cycle. Work hardening is then defined to mean that, for 
all such added sets of stresses, positive work is done by the external agency 
during the application of the stresses, and the net work performed by it over 
the cycle of application and removal is either zero or positive. Rephrased, 
Drucker’s definition of work hardening means that useful net energy over 
and above the elastic energy cannot be extracted from the material and the 
system of forces acting upon it. 

Consider a volume of material in which there is a homogeneous state of 
stress <7jj and strain e tJ . Suppose that an external agency applies a small 
surface traction, which alters the stress at each point by daij , and the strain 
by deij. On removal of the small traction, do l3 returns to zero, and a strain 
defj is recovered. Then, according to the definition above, the material is 
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said to be work-hardening if the following two conditions hold true. 

(1) daijdetj > 0, upon loading; 

(2) dtJij(deij — de^) > 0, on completing a cycle. 

Let de denote the plastic strain increment, which is not recovered by the 
process named above. Then, since de[^ = dey —deff, the second condition 
may be written as 

(3) doijde^ > 0. 

Drucker extended the definition of work-hardening to allow for a finite 
dcFij produced by the external agency. In fact, the initial stress, say may 
at any point be inside or on the yield surface far away from <ry, The work 
per unit volume done by the external agency is (try — ^. Equation (3) 

is replaced by 

(4) ((Tij - dij)4f > 0. 

Drucker’s hypothesis Eq. (3) or Eq. (4) can be satisfied only by materials 
whose subsequent yield strength increases with deformation. Any material 
on which an external agency does positive work during an elastic-plastic 
stress cycle is called a hardening material. Otherwise, it is considered as 
nonhardening or work-softening material. It can be shown that Eq. (3) or 
Eq. (4) holds for work-softening and perfectly plastic materials [von Mises 
(1928), Bishop and Hill (1951)] under Il’iushin’s postulate of plasticity in 
strain space to be considered in Sec. 6.14. 

Equation (4) is also called the principle of maximum plastic dissipation. 
It can be written in the form 

( 5 ) ViM? = D(e ( if,&) > dijef) , 

where D(e?\ £*) depends on the plastic strain-rates eff and the internal 
variables & only. Equation (5) will be used for limit analysis to be discussed 
in Chapter 10. 

6.10. IDEAL PLASTICITY 

According to Drucker’s definition of strain hardening, we can define 
ideal plasticity as a plastic deformation without strain hardening. It is 
mathematically specified by the condition that, when plastic deformation 
occurs, the equality sign prevails in Eq. (6.9:3): 

daijde\f = 0, 


( 1 ) 



168 


ELASTIC AND PLASTIC BEHAVIOR OF MATERIALS 


Chap. 6 


and that the yield function is unaffected by ef ). The differentials da l0 and 
de^ must be interpreted as in Sec. 6.9. 

As an application of this definition, we shall derive the flow rule during 
an ideal plastic deformation. We notice that the yield function furnishes a 
criterion to tell whether yielding occurs or not. If yielding does occur, we 
need further information concerning the increment or rate of deformation in 
order to complete the description of the material behavior. In other words, 
we need a flow rule. 

Now, for an ideal plastic material, we assume that a yield function /(cr^) 

(v) 

exists, which is a function of the stresses cry and not of the strains , 
such that /(cry) < 0 prevails; and 

(2) ^ 0 only if /(op) = 0. 


Since / is assumed to be a function of aij only, any change in stresses 
during plastic flow must satisfy the relation 


(3) 


df = dtTij = 0 . 

UCij 


Equation (3) is often called the consistency condition for ideal plasticity. 
When plastic flow occurs, Eq. (1) holds. A comparison of Eqs. (1) and (3) 
shows that 


(4) A 


de {v) - dA-^~ 
^ l ’ 


where dA is an arbitrary constant of proportionality. We can also write 
Eq. (4) in the rate form: 


(5) 


,(p) . 1 °f 

13 (l d(Jij 


If df/doij has the dimension of stress, then p has the physical dimensions 
of the coefficient of viscosity, but it is certainly not a material constant, 
and may vary during the deformation. The sign of dA or p is restricted 
by the condition that plastic flow always involves dissipation of mechanical 
energy, a condition which may be written as 


( 6 ) 


W = 


> 0 . 


Equation (4) gives the rule of plastic flow (plastic strain increments) in 
ideal plasticity. It is a prototype of the theory of plastic potential developed 
by Richard von Mises 6 3 (1928). The general theory is discussed in Sec. 6.11 
et seq. 


Prager’s Geometric Interpretation. An interesting geometric inter¬ 
pretation of Eqs. (4) and (5) was given by Prager. The formula /(<7y) = 0 
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defines a surface in the nine-dimensional stress space with = 1,2,3) 

as coordinates. The outward normal vector to this surface has the compo¬ 
nents df/dcrij. Equation (5) states thus that the vector of plastic deforma¬ 
tion rate is normal to the surface / = 0 in the stress space. So, during 
unloading, 

d f 

/ = 0 , daij < 0 , 

d<Tij 

the stress increment dan is pointing inward from the yield surface, while 
during loading or neutral loading, 

df 

f = 0 , T~d(Tij = 0 , 

i.e., the stress increment dcrij is on the tangential plane at a stress point 
on the yield surface. Because the yield surface is fixed for ideal plasticity, 
dcrij cannot point outward. 


Incremental Stress-Strain Relationship in Plastic Flow. Using the 
basic assumption that the total strain increment can be decomposed into 
an elastic and a plastic component, 

(7) deij = de[f + de[f , 

together with the flow rule and the consistency condition, we can determine 
dA in Eqs. (4) and (5) in terms of the total strain increment and the current 
state of stress. We have 

(8) dcrij — DijkildGfci de^ ] — Dijkidcki dDijki dA . 


Multiplying both sides of Eq. (8) by , we obtain 

( 9 ) 


d f ^ f ^ f ^ f 

— del l j = — Dijkideu Dijki 7} -dA. 

dan dan dan da k i 


Since dan = 9 from the consistency condition, we can solve for dA 


(10) A 


rfA — Dijicidc^i 




df df 


' mn rs , 


A substitution of Eq. (10) into Eq. (8) establishes the general stress-strain 
increment relationship 

dan = D ijki[de k i - deff] = D^ kl de k i , 


( 11 ) 
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where 


( 12 ) — Dijki Dijtu D qp ki 


df df 
da mn da 

rs 


For isotropic materials, D^ki — + G(dikdji + <^<5.//c), Eq. (11) reduces 

to 


(13) daij = A de aa Sij + 2 Gdeij 

(~f~ QGTiij^^^TiflpdGQKx H" 2Grikideki) 

d" 2 G 

where A is the Lame constant, and 

df ( df of y 1/2 

7113 da tj \da ki da k t) 

is a unit outward normal to the yield surface. For materials with plastic de¬ 
formation insensitive to hydrostatic pressure, we have rikk = 0 and Eq. (13) 
reduces to 

(15) da^ — Xde aa 5ij -f~ 2iGd&ij . 

W. T. Koit.er 6 3 (1953) generalized this theory of plasticity by allowing 
the yield limit to be specified by a set of yield functions, 

/l(iTij), fl^ij) , ■ ■ ■ i fn{&ij) ■ 

A state of stress is said to be below the yield limit if all these functions 
are negative. For a state of stress at the yield limit , at least one yield 
function vanishes, while none has a value greater than zero. 

If the functions fh = • ■ • = f m = 0, (1 < h < rn < n) whereas ail other 
/’s are negative, then the Koiter generalization of the flow rule given in 
Eq. (5) is 

(16) ^ ] ^ dXh ^j + "’ +dAm ^~' 

where ciA/,,..., dA m are nonnegative proportional factors. Thus, in the case 
of ideal plasticity, the basic concept leads at once to a general incremental 
stress-strain relationship. 


Example 1. Under the von Mises yield condition, Eq. (6.5:1) and the 
assumption that the constant k is independent of plastic deformation, we 
can derive the flow rule below according to Eq. (5): 

(17) 4 ? = H . 

which is the rule presented in Sec. 6.3. 
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Example 2. Tresca’s yield condition can be expressed in terms of Koiter’s 
generalized plastic potential by defining f’s as the followings: 

/l = <r 2 - a 3 - 2 k , f 2 = o- 3 - <7i - 2 k, 

(18) f a — <7j - <r 2 - 2 k , h =-{<*2 - 03 ) - 2 k, 

fa - -( 0-3 - <7i) -2k, f 6 = -(o-i - ct 2 ) - 2k , 

where o\, a 2 , u 3 are the principal stresses. A state of stress is below the 
yield limit if f \,..., fe are all negative. Yielding occurs when one or more 
of the f’s are equal to zero. None of the f’s can have a positive value. 
When yielding occurs, the flow rule is given by Eq. (7). For example, if 
fi = 0 , while all the other f’s are negative, then 

(19) eW=A ls 4 P) = -Ar, = 0, 

where Ai > 0 , and ei,e 2 ,e 3 are the principal strains corresponding to 
<7i, < 73 . The principal axes of the strain tensor coincide with those of the 

stress tensor under Tresca’s condition. 

Remark: Formal application of the method of derivation above to a simple 
load-deflection experiment may appear difficult. If we twist a tube of ideal 
plastic material in torsion, the yield condition according to von Mises’ crite¬ 
rion is reached when the shearing stress r — k the yield stress. Plastic flow 
will continue with no possibility of increasing r and k. Hence, if we limit 
ourselves to torsion and apply Eqs. ( 1 ) and (2), we would have obtained 
dr = 0, which yields no useful information. To deduce anything significant 
we would have to consider adding other loads, for example, tension or in¬ 
ternal pressure in the tube. These additional varieties of loads will alter 
the plastic flow, thus providing nontrivial changes diJij and de^- to which 
the derivation above applies. 

Problem 6.7. A plane strain condition is defined as e 33 = e 3 i = e32 = 0 . 
This condition requires de^ t = —de%i, for i = 1,2,3. Consider the von Mises 
criterion with the associated flow rule delt = dA<r(,-. Show that 

<73 = U( 0 T +<72) 

where ai are the principal stresses. 

6.11. FLOW RULE FOR WORK HARDENING MATERIALS 

In this section, the Drucker’s hypothesis discussed in Sec. 6.9 is taken to 
define work-hardening, and von Mises’s plastic potential theory is taken as 
the framework to derive the flow rule. Von Mises 6,3 (1928) suggested that 
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there exists a plastic potential function h(oij) so that the plastic strain rate 
could be derived from 


( 1 ) 


dp) 


dh 


where A is a positive scalar factor. If the plastic potential is the same as 
the yield function: h = /, Eq. (1) is called the associated flow rule. On the 
other hand, if h / /, the flow rule is called non-associated. In the preceding 
section we have seen that h = f for an ideal plasticity body. Now, we shall 
consider the more general case of work hardening materials by allowing the 
plastic potential to be a function of not only stresses but also temperature 
T and internal variables We shall show that under Drucker’s hypoth¬ 
esis and the assumption that the elastic moduli Dyn are independent of 
the plastic deformation, the yield function f itself is the plastic potential. 
Experimental observations show that the associated flow rule characterizes 
the plastic deformation of metals quite well, but the nonassociated flow 
rule provides a better representation for the plastic deformation of porous 
materials such as rocks, concrete, and soils. 


Consequences of Drucker’s Hypothesis. We shall prove the following 
consequences of Drucker’s hypothesis from which the term work-hardening 
in defined: 

A. The yield surface and all subsequent loading surfaces must be 
convex. 

B. The plastic strain increment vector must be normal to the loading 
surface at a regular point, and it must lie between the adjacent 
normals to the loading surface at a corner of the surface. 

C. The rate of change of plastic strain must be a linear function of the 
rate of change of the stress. 

Proof of A. To facilitate the proof, 3 let us think of an increment of stress 
doij as the components of a vector dS in the nine-dimensional stress space, 
and the corresponding plastic strains de[^ as the components of a vector 
dE in the same (stress) space. Then, by Eq. (6.9:3), we have 

(2a) dS-dE = |dS||dE|cosi/>>0, 

which implies that 

7T 7T 

(2b) -2***2' 

i.e., the angle between dS and dE must be acute. 

3 The following explanation follows that of P. M. Naghdi 6 - 3 (1960). 
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Now let P be a regular (smooth) point 
on the yield surface and dE be a 
plastic strain vector at P. Accord¬ 
ing to Eqs. (2a) and (2b), all stress 
increments d S that will produce dE, 
i.e., vectors that represent loading in 
the stress space and end at P, and 
corresponding to dE, must form an 
acute angle with dE. If we represent 
a hyper-plane normal to dE by AB 
in Fig. 6.11:1, then the vectors dS(= 
d<7y) must all originate in one side of 
the hyper-plane AB. The initial points 
of the dS vectors, represented by stress states inside or on the yield surface, 
must thus all lie on one side of AB. However, dS are loading vectors (since 
dE exists); they are outward vectors whose directions are bounded by the 
tangent plane of the yield surface (see Sec. 6.6). Hence, the hyperplane AB 
must be tangent to the loading surface. Since dE is normal to AB, it is also 
normal to the yield surface at the point of tangency. Furthermore, since 
the yield surface lies on one side of the tangent plane, the yield surface is 
convex at P. Finally, since P is an arbitrary point on the yield surface, the 
convexity of the entire yield surface is established. 

Proof of B. Since at a regular point on a surface there is a unique tangent 
plane, the hyper-plane is unique at a regular point of the yield surface. 
Thus, the direction of dE normal to the hyper-plane is also unique. In 
other words, at a smooth point of the yield surface the direction of dE 
(i.e. de\^) is independent of the direction of doy. At a corner on the 
yield surface, there can have been more than one limiting tangent plane; 
convexity must still hold, but the direction of strain increment def^ may 
depend on the direction of the loading vector daij. 

Proof of C. The truth of C is exhibited by Eqs. (14), (18) and (19) below. 
To derive these equations, we need the following formulas for normality and 
consistency. 

Normality Condition. Under Drucker’s hypothesis, the normality of the 
plastic strain rate vector to the loading surface at a smooth point of the 
loading surface requires that 

(3) A e-f = Any 



Pig. 6.11:1. Possible stress increments 
corresponding to a plastic strain incre¬ 
ment dE. 
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where A is a proportional scalar factor, which can be a function of stress, 
strain, strain history, temperature and internal variables, and 



is a unit normal to the yield surface at the loading point. Since the work 
done by an external agency during loading must be positive, it is easy to 
show that A must be nonnegative. Equation (1) is similar to Eq. (6.10:5); 
thus, the loading function h plays the role of plastic potential. We transform 
Eq. (1) to the form of Eq. (3) so that the flow equation takes on the form 

(5) ef) = Ahij , 

where h^ is a unit normal to a plastic potential h(o l3 , T, £,) at a smooth 
point of the surface, 

dh ( dh dh y 1/2 

U ij ~ da^ \d<r kl da kl ) 

Comparing Eqs. (3) and (5), we obtian 

(7) hij = Tiij , 

(8) * = /(*«, T, CO- 

In other words, Eq. (5) is an associated flow rule. 


Consistency Condition. As plastic flow proceeds, the subsequent load¬ 
ing surfaces pass through the loading point, we must have 

(9) /K-,T,&)=0. 

Expressing the internal variable-rates in the form 

(10) 6 = A hi , 

in which hi is a known function of a^, T, and we must have, during 
loading, 
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where K p (> 0) is called the plastic modulus. Prager named Eq. (11) the 
consistency condition , which means that loading from a plastic state must 
lead to another plastic state. Combining Eq. (12) with Eq. (5), we find the 
flow rule in the form 


(14) A 


. (p) _ h'ij'ft'kl&kl 

e ij ~~ ~f(P 


Since = 0 during unloading, [nij&ij < 0, see Eq. (6.6:3)], one may write 
the flow rule in a slightly different form 


(15) 


j .( p ) _ ^ij^kldkl 
e ij ~ ffp 


dd(n mn & mn ), 


where H is the Heaviside function defined as 


H(x) = 1, if x > 0, 

— 0, if x <0. 


In other words, it is the unit-step function denoted by l(x) elsewhere in 
this book. 


Incremental Strain-Stress Relation. Consider the yield surface 

(16) /(tfy.ejj*,*) = 0. 

The internal variables are and k, with the latter being a function of ejj? 
also. The flow rule is given by Eq. (5) and the equations for the internal 
variable-rates are 

8k 


8e 


(p) kl 
kl 


. , f) K (p) 

(f) = a^ h kl . 

oe k i 


The consistency equation is 


df_ 

da a 


dij + 


( 


8} 8k 
V dei l 'J + 9k del P J 


h rs A = 0 . 


We obtain A in the form of Eq. (12) with 


(17) 


K p 


( 8f 8f 8 k\ ( 8f 8f \~ l/a 

V de ( /J 9k de ( r p J / P \ da ki 8 o k i) 


for the plastic modulus. These results were first given by Prager 6 3 (1948) 
and Drucker 6 ' 3 (1959). Equation (14) proves the linearity statement (item 
C) given in Drucker’s hypothesis. Using 

de H - D ijh dcr ki + de\f 
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and Eq. (14) leads to the incremental strain-stress relation (the constitutive 
equations) for plastic deformation 


(18) deij = [D^h + hijUki/K p }da kl , or 

dciki = [Dkirs + rikih rs /K p ]~ l de rs = D e ^ lrs de rs 

where often denoted by Cym and called the elastic flexibility tensor, 

are the components of the inverse of the fourth-order elastic modulus tensor 
Djjki ■ Equation (18) breaks down for ideal plasticity because 


0, 


9L =0 ie df _ 9k 

dim ’ de$ n deijil 

and K p becomes zero. In this case we have to use the incremental stress- 
strain relation given in Eqs. (6.10:11) and (6.10:12). 


Koiter’s Generalization. For a loading surface with corners, one can use 
Koiter’s generalization to define the flow rule. Such a surface is composed 
of a number of individual smooth loading surfaces f r , which meet to form 
corners. Koiter 6,3 (1953) has shown that if the loading surfaces described 
by f r = 0 act independently, the total plastic deformation can be written 
as the sum of contributions from certain of the / r ’s, as follows: 

( 19 ) e$ = ^ C r k r (n ij ) r (n k i) r &ki , 

r=1 

where A r are positive functions associated with f r as defined in Eq. (12) 
and 

C r = 0 if f r < 0, or {nki) r tiki < 0, 

( 20 ) 

C r = 1 if f r = 0 , and (n fc /) r d fc ; > 0 , 
in which ( riki) r are the components of the unit outward normal to f r at the 
loading point. Equations (19) and (20) specify, of course, the condition of 
yielding and loading. 

It should be remarked that the properties deduced above, namely, the 
convexity, normality, and linearity, follow Drucker’s hypothesis — which is 
often interpreted as a statement that the material is stable. Hence, these 
properties holds only for the class of stable materials. Unstable materials 
do exist. Mild steel at its upper yield point (see the point A* in Fig. 6.4:1) is 
a well-known example. Other engineering materials such as rocks, concrete, 
and soils exhibit softening phenomena for which 

doijdeij < 0 and deride^ < 0 . 
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A formulation of the plasticity theory in the strain space is needed to de¬ 
scribe the softening behavior. 

The normality of def) and the convexity of the yield surface do not 
hold under Drucker’s postulate, if Djjfci is a function of i.e., there is 
elastic-plastic coupling during the process of plastic deformation. This is 
also the case for softening materials. Il’iushin (1960) showed that Drucker’s 
postulate results in 

daij[dD^haki + de$] > 0 
de\f = - (TkidD-} kl {e^ n ). 


Example. If we choose f = J^—k, where J 2 is the second invariant of the 
stress deviation, and k is a hardening parameter, which depends on plastic 
deformation, then the flow rule at yielding is 


& = 6°L°L &u = 6l a f BJ * 


dcrij d(Jki \ dJ '2 da ' 

= Gv'klKsKs — C* a 'kl » ^2 > 0 

where from Eqs. (5), (12) and (17), 


df da' kl 
dcr'ki dt 


G^~-- 


1 ( df df 




K p \ da mn da„ 


If we set / = F( J 2 , J 3 ) — k, and assume dn/de'mn ^ 0 for some m and n, 
then the flow rule is 


& = G 


dF , dF 
dJ 2 a ' j + dJ 3 tij 


F >0, 


where t*j = a' lk a' k . - |./ 2 (5tj. Note that dakk/da^ — 6ij. 

Note that the yield-function of a work-hardening material depends on 
the plastic strain eff in a significant manner, in contrast to an ideal plastic 
material, whose yield function is independent of the plastic strain. 


6.12. SUBSEQUENT LOADING SURFACES — ISOTROPIC AND 
KINEMATIC HARDENING RULES 

We have discussed yield surfaces and flow rules in previous sections. 
Now we must consider the third aspect: the determination of subsequent 
loading surfaces as plastic flow proceeds, i.e., to determine how the 
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(a) (b) 


(o) W 


(a) ( f ) 

Fig. 0.12:1. Several hardening rules, (a) Initial yield condition (Tresca), (b) isotropic 
hardening, (c) kinematic hardening (Prager), (d) Kinematic hardening (Ziegler’s modifi¬ 
cation), (e) independently acting plane loading surfaces, (f) interdependent plane loading 
surfaces. (From Naghdi, 6 ’ 3 1960.) 

internal variables including the plastic deformation eff enter into the load¬ 
ing surface 

(1) /K,r,6) = 0. 

Laws governing this aspect are called hardening rules. 

If we assume that plastic deformation is independent of hydrostatic 
pressure and that the plastic flow is incompressible, then the yield surfaces 
in the principal stress space (<7j, 02 , < 73 ) are cylinders (not necessary with 
circular cross-section) of infinite length with axis <7i = <72 = <73 as illustrated 
in Fig. 6 . 12 : 1 . The plane < 7 i + + (73 = 0 is called the n-plane, which is 

perpendicular to the axis. Hence, the yield surfaces can be represented by 
their cross sections on this plane (see Sec. 6.7). The cross-sectional curves 
are closed, convex, and piecewise smooth; but they change in size and shape 
during plastic deformation. For materials involving volume change in the 
plastic deformation, the yield surface is not parallel to the a x — a 2 — (73 
axis. 

We illustrate several proposed hardening rules in Fig. 6.12:1: 

Isotropic Hardening. Isotropic hardening assumes that the material re¬ 
mains isotropic during the process of plastic loading and that the subsequent 
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yield surface is a uniform expansion of the initial yield surface. The initial 
and subsequent yield surfaces have the same center. Figure 6.12:1 shows 
an example of isotropic hardening with a Tresca’s initial yield surface of a 
regular hexagon on the 7r-plane a\ + a*, + cr 3 = 0 [Fig. 6.12:l(a)] and its 
uniform expansion as a subsequent yield surface [Fig. 6.12:l(b)]. 

In the general case, if the effect of hydrostatic pressure on yield can be 
neglected, the subsequent yield surfaces can be expressed in the form 

(2) / = /V2, ■*»)-« = 0, 

where k is an internal variable that characterizes the hardening of the ma¬ 
terial. The strain-hardening hypothesis assumes that k is a monotonically 
increasing function, which depends only on the effective plastic strain but 
not the strain path. An effective or equivalent plastic strain, which is an 
internal variable, can be defined as 

(3) < = jde*, 

where 

(4) A del = y/| de\fde\f . 

Then k is a monotonically increasing function of e p . For uniaxial loading, 
we have 


(5) 


where e p is the total uniaxial plastic strain. 

Use of Eq. (6.11:12) for A and the flow rule Eq. (6.11:14) for e-^ in 
Eq. (4) leads to 

/ fi \ -p _ /2 n kldkl _ ; ft 

(6) e ~v 3 K p ~ \ 3 ' 

The internal variable-rate defined in Eq. (6.11.10) becomes 


(6a) k = ii — khi = A^| . 

For the yield function as defined in Eq. (2), Eq. (6.11:17) gives 


(7) 


K p = - 


[2 df 3k 
V 3 9k de p 


( df df 
V do kl da hi 


-1/2 


where df /dn = — 1. Once K p is determined, we can calculate e p and 
from Eqs. (6) and (6.11:14). The calculation of K v will be discussed later. 
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Another commonly used parameter for characterizing isotropic harden¬ 
ing is the total plastic work defined by 

(8) W p = / o-ydejf = J cr'ijde'f) . 

Assuming k as a function of W p is called the work-hardening hypothesis. 
The function k(W p ) or cry(W p ) can also be determined from a uniaxal 
tension test, in which case we have, 


( 9 ) 


K P = -h„ 


df Ok 
dn dW p 


df df y 1/2 
da id da k i) 


For the von Mises yield function / = s/Tf — k, we have /t(e p ) = cry(e p )/ \/3i 
where cry(e p ) ‘ s the tensile yield stress. The yield stress can be determined 
from the uniaxial curve of a versus e p (= e p ) in simple tension and com¬ 
pression. For linear hardening materials , 


( 10 ) 


(J Y = a°y + E p e p 


where E p is a material constant, from Eq. (8) we find 

( 11 ) W p =(a 0 Y + l -E p e?y p . 

Then 

(12) a\ = {a° Y ) 2 + 2E P ^ e p = (4) 2 + 2 E p W p . 

In this case we have, from Eqs. (7) and (9), 

df da 1 dcry E p 

dn de p s/3 de% s/3 ’ 

df dn 1 day E p 

dudWp s/3 dW p s/3ay ' 

Example of Isotropic Hardening. Consider a thin-walled tube sub¬ 
jected to stretching and torsion, with tensile stress cr(> 0) and shear stress 
r. The Tresca and von Mises yield functions can be expressed in the form 
/ = a — ay , with 

(13a) a - cti - ct 3 = (a zz - a 9 e) 2 + 4cr 2 0 (Tresca), 
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and (13b): 


a = V + t'” + ^ - + 3 W , + ,?.) 

(von Mises). 

For a zz = a, a z g — r, a rr = ogg = a rz — a r g = 0, Eqs. (13a) and (13b) 
reduces to Eqs. (6.5:8-9). The yield surface becomes 


f = a — ay — \/a 2 + or 2 — ay — 0 , 


where a = 4 for the case of TYesca, and a — 3 for the case of von Mises. 
As to the flow rule, we obtain 


e?’ = aA 


Of 

da IZ 


= A 


(14) 


A df 
a 2 do z # 

= Ko = 0, 


b ' 

A ar 

2T 


where a is a normalization factor, 6 = s/o' 1 Ar a 2 r 2 /2 and 


e p e9 = -e p zz , e p r = 0 for the Tresca criterion, 

e 99 = e p r — — ~e p z , for the von Mises criterion. 

Note that the factor 1/2 for e p 6 in Eq. (14) is due to the fact that we have 
treated a z $ and ag z as one rather than two independent variables in the 
yield function. 

From Eq. (7), we have 


K p = 


12 day a 
\3~dgb' 


Then, A can be derived from Eqs. (6) and (7) 

• _ l3 fday\~ l 00 + arf _ l3 fdoy\ 


V 2 \de p ) 


V 2 V de p J 



where day fde p is the slope of the uniaxial tension a-e p (stress-plastic strain) 
curve evaluating at e p = e p . The equivalent plastic strain e p is obtained 
from Eqs. (3) and (4), 


-! 


de p 


5 dA = 


sm 


ado + ardr 


-/( 


day \ 

~d&) 


da. 
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For linear hardening materials, Eq. (10) gives 


day 

deP 


= E p . 


Finally, for isotropic materials, we have 




2 e z g 


G +2i 


( V ) _ r 


, i ar 

g + a T 


When the stress path is given, we can obtain the strain history by inte¬ 
grating these equations. This solution is complete because the system is 
statically determinated that the stress field can be determined directly from 
the applied load. 

Lee and Zavenl 6,5 (1978) and Chaboche 6 - 5 (1977) introduced a nonlinear 
evolution equation for the isotropic hardening parameter k: 


dn = b(K s — n)de p 

where b and k s are material constants. This equation can be integrated to 
obtain 

K,(e p ) = k s + (/c 0 - K s )e~ bf ' . 

For cyclic loading, e p increases monotonically and /c approaches k s after a 
number of cycles. The number of cycles required for k to reach the steady 
state value depends on the cyclic strain magnitude. 


Kinematic Hardening. The kinematic hardening model assumes that 
during plastic loading the yield surface translates in the stress space with¬ 
out rotation and without change in size and shape. Figure 6.12:l(c) illus¬ 
trates Prager’s kinematic hardening showing that the initial yield surface 
translates in the 7r-plane. To explain this rule, Prager 6 3 (1954) used a 
mechanical model, which can be represented as in Fig. 6.12:2. The ini¬ 
tial yield surface is regarded as a planar rigid frame lying on the 7r-plane. 
The loading point on the 7r-plane is represented by a small, frictionless pin. 
If the pin engages the frame, it may push the frame around. Under the as¬ 
sumption of frictionlessness, any motion imparted by the pin to the frame 
must be normal to the edge in contact. However, when a corner of the 
frame is caught by the pin, the pin may carry the frame if the direction of 
motion lies within a certain angle. Rotation of the frame is supposed to be 
prevented by some mechanism. If the pin disengages and moves away from 
the frame, the frame stays put, and the change represents an unloading. It 
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is obvious that none of the flow rules deduced from Drucker’s hypothesis 
is violated, and no theoretical objection can be raised against interpreting 
the motion of the rigid frame as a hardening rule. In fact, the Bauschinger 
effect is represented very simply, and the development of anisotropy due to 
plastic deformation appears most naturally. 



Fig. 6.12:2. A mechanical model used in explaining the kinematic hardening rule. 

With some variation in the model, Prager 6 3 (1954) was able to represent 
various models of plasticity: rigid, perfectly plastic, rigid work-hardening 
elastic, etc. Almost simultaneously, a similar concept was introduced in¬ 
dependently by Ishlinski 6,3 (1954). Further developments were made by 
Prager, 6,3 Boyce, 6 ' 3 Hodge, 6 3 Novozhilov, and others. Hodge 6 ' 3 (1956) 
points out that the concept of kinematic hardening can be applied in the 
nine-dimensional stress space. An example of Prager’s kinematic hardening 
is given in Eq. (6.8:28). 

Suppose that the initial yield surface is — 0. The subsequent 

yield surface can be expressed as 

(15) f{<Ji j -a ij )= 0, 

where a i3 , usually called the back stress, represents the translation of the 
center of the initial yield surface during the process of plastic loading. The 
formulation of a kinematic hardening model is to specify the evolution of 
Oiij in terms e?-, cry, and/or a, 3 itself. Prager’s linear kinematic hardening 
model assumes that 

(16) oiij = cef^ = ckhij 

where c is a constant (Shield and Zielger, 6 ' 3 1958). According to this model 
the yield surface moves in the direction of the plastic strain rate, i.e,, the 



184 


ELASTIC AND PLASTIC BEHAVIOR OF MATERIALS 


Chap. 6 


direction normal to the yield surface at the loading point. From Eqs. (15), 

(16) , (6.11:6) and (6.11:13) with Awe obtain, 

(17) K p = chijTiij . 

If the material obeys the associated flow rule, we have 

K p = c. 


Prager’s linear kinematic model does not give consistent results for 
three-dimensional and two-dimensional cases. It also introduces a trans¬ 
verse softening in simple tension or compression as shown below. In a uni¬ 
axial tension test with load in the 1-direction, the plastic strain increments 
are 

de\ p) = deP , 

de ( 2 p) = de ( 3 p) = -~de p . 

From Eq. (16), we obtain 


(19) 


da i = c,de p , 

ck *2 = da 3 — - -da\ = — ~cde p . 


The yield surface moves toward the negative 2- and 3-directions causing 
transverse softening, although no load is applied along these directions. 
Ziegler 6 ' 3 (1959) modified Prager’s rule Eq. (16) by replacing it with 


( 20 ) 


bi{j — aij) 


where ft > 0. Geometrically, this means the yield surface moves along the 
direction of cry — oty, which is the radial vector joining the instantaneous 
center a t) and the loading point cry. Ziegler’s modified kinematic hardening 
is illustrated in Fig. 6.12:l(d). The determination of ft will involve the yield 
criterion. 

Armstrong and Frederick (1996) introduced a nonlinear term in Prager’s 
model Eq. (16), 

(21) aij = ce\f - jaije? = A fc/iy - ^qoy 


Using Eq. (21) as the internal variable-rate and 
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Eq. (6.11:13) reduces to 


( 22 ) 





This model was further developed by Chaboche 6,6 (1977, 1986). 


Combined Isotropic and Kinematic Hardening. The yield surface of 
a combined isotropic and kinematic hardening model can be expressed in 
the form 

(23) /(cry - otij , k) = 0, 

where aij and k are the internal variables. For dy in the form of Eq. (21) 
and k as a function of the equivalent plastic strain, Eq. (6.11:13) gives 


(24) K p = chij - 



Tlij 


pdf 8 k f df df \~ 1/2 
V 3 8 k q £ [p) V 9uij ddij J 


If the material obeys the associated flow rule (hy = ny), Eq. (24) 
becomes 



From Eqs. (6.11:4, 14, 18), reproduced here for clarity, 

df ( df Of \~ 1/2 

n ' 3 dcr i:j \dar kt da k i J 

.(p) _ 'flij'R'kl&kl 

e>i = KP ’ 

de r ,= [D;i, + ^]d atl 

we can determine the plastic strain rate and incremental stress strain 
relation. 

More complicated hardening rules involve translation, expansion and 
distortion of the yield surface simultaneously. In Fig. 6.12:l(e), the plas¬ 
tic deformation causes a linear segment to move. Figure 6.12:1(f) shows 
that the loading surface changes with plastic loading in some interde¬ 
pendent manner. Hodge 6 3 (1957) extended the kinematic hardening to 
include simultaneous expansion of the yield surface. Budiansky 6,3 and 
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Kliushnikov 6,3 in 1959 considered the possibility of creating corners in sub¬ 
sequent yield surfaces at the point of loading. They achieved a compromise 
with the Hencky-Nadai “deformation” theory or the “total strain” the¬ 
ory. See Hill 6 ' 2 , Nadai 6 ' 2 and papers by Budiansky, Kliushnikov, Naghdi, 
Sanders, Phillips, etc., in Bibliography 6.2 and 6.4-6.6. 


Example of Combined Isotropic and Kinematic Hardening. For the 

von Mises criterion, the yield surface is 

/ = \(°ij - <4)K> - <4') - \°Y = 0 » 

where crC and a- are the deviators of er^ and a^-, respectively. Then 


df 8 k lay day 

Ok dee 3 de p ’ 



(26) 


J „ f^ n ijdoij j a ij)d(7ij 

£ e = V 3 K p ~ K p cr y 


Eq. (25) becomes 


(27) 


K p 


ay 


2 day 

3 die 


Finally we obtain the strain-stress relation for the von Mises criterion with 
combination of isotropic and kinematic hardening, 




' _i 3 ( 4 ,- ~ a'ij){a' kl - ait) 
u H kl + 2 K p al 


da k i ■ 


We shall illustrate the determination of the yield stress cry (as functions 
of e p ) and the material constants (c, 7 ) from data measured from uniaxial 
tests for the von Mises criterion. For uniaxial tension in the 1 -direction, on 
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the yield surface, we have 

de { $ - de p , de { £ = de ( £ = ~de p , de p = |de p |, 

2 1 

(28) a' u = -a, a 22 = a^ = --a, 0 n = o’, <722 = <7 33 = 0, 

2 2 

°n - a ii = - a) = ±3°V , 

/ / i i _ a ~ a _ 

a 22 — a 22 — a 33 a 33 — g — ' 3 * 

All other components of ey, oy and ay are zero. The upper and lower 
signs in the last two equations of Eq. (28) correspond to loading (tensile 
plastic flow) and reverse loading (compressive plastic flow), respectively. 
The yield function can be simplified to become 

|it — aj = <7y . 


Prom Eqs. (21), (26) and (27), it follows that 


(29) 

(30) 

(31) 


da = - cde p — r ya\de p \ 


K p de p = ^ dcr, 

de p e ~ 2 2 T7 + de? ' 


Equation (31) shows the different effect of the nonlinear term 7 a on the 
tensile and compressive plastic flow. Since 7 > 0, the reverse plastic flow 
has a higher hardening modulus. 

Equation (31) reduces to Prager’s linear kinematic hardening if 7 = 
(day/del^) = 0. In this case, the yield function is simply 


(32) 


<7 = a ± Oy — ce v ± Oy . 


,T7(e 


In general, Eq. (29) can be integrated to give 

3 c ( 3c \ 

(33) a(e p ) = ±—+ 

The yield condition becomes 

( 34 ) <7 = a(e p ) ± try = ±— + ^a 0 T e T-y(e p -e 0 ) _j_ ay ^ 
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where the upper and lower signs are, respectively, for tensile loading that 
e p increases from and for compressive loading that e p decreases 
from e p . 



Fig. 6.12:3. A schematic loading path for evaluation of c, 7 and cry. 

Equations (32) or (34) can be used to determine the material constants 
(c, 7 ) and the yield stress cry as function of e p . Consider a load path 
OABCDE shown in Fig. 6.12:3, where unloading occurs at points B and D, 
and yielding starts at points A, C and E. Equation (34) gives 

(35) a B = a(ef) + oy{e\) , a E = a(e p 2 ) + ay {e p 2 ), 
for the tensile loading curve EB, 

(36) o c = a(e\) - cry(e^), a D = a(ef) - cry(e£), 

for the compressive loading curve CD, where e\ and e\ are the values of 
the plastic strain along BC and DE, respectively, and the subscripts B, C, 
D and E denote the value of a at those points. Similarly from Eq. (33), 
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we find 


(37) 


(38) 

a ( e l) = + ( Q ( e l) 


3c . 3c _ -P r e P_-P\ 

=-— + — (2-e 7 i)e 71 2 e i> . 

27 27 v ; 

The following relations are derived for c, 7 from Eqs. (35)-(38), 

(39) ^[1 -e-^} = ±(* B +*c), 

( 4 °) + ^( 2 - + ° e ) • 

One then determines, from Eqs. (31) and (33) with a 0 = e^J = 0, 

(41) ^ = 

de? de p L»_,p 2 


as a function of e p where $~\ eP=e p is the measured curve AB. Recall that e p 
is the plastic strain measured in the uniaxial tension test. Equation (41) is 
used to evaluate K p in Eq. (25) or Eq. (27) for multi-axial loading analyses. 

In the derivation c and 7 are assumed to be constant. In the more 
general formulation, c and 7 can be functions of plastic deformation. 


6.13. MROZ’S, DAFALIAS AND POPOV’S, AND VALANIS’ 
PLASTICITY THEORIES 

Hardening is one of the most important mechanical properties in plas¬ 
ticity. The theories in Secs. 6.3-6.12 characterize such behavior by the 
evolution of hardening parameters (e.g., k and a^). The models discussed 
include the isotropic hardening and three types of kinematic hardening 
(Prager, Ziegler, and Armstrong and Frederick) which are among the sim¬ 
plest plasticity laws regarding this behavior. These models offer a rea¬ 
sonable description of hardening properties for monotonically proportional 
loading without unloading. The material behaviors under cyclic loading are 
much too complex to be modeled by these hardening rules. Based on exper¬ 
imental evidence, Drucker and Palgen (1981) identified five basic features 
of cyclic plasticity: 

(1) Plastic strain accumulates, the cycle creep or ratcheting effect, in 
the direction of mean stress for stress cycles. 
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(2) The mean stress progressively relaxes to zero for strain cycles with 
nonzero mean. 

(3) The elastic-plastic transition is usually smooth. 

(4) Material hardens or softens toward a stabilized state with kinematic 
hardening only. 

(5) Extensive plastic loading eliminates most, if not all, of the past 
effects. 

It can be shown, under strain cycles, that isotropic hardening will lead 
to an elastic state eventually and thus is not capable of simulating the 
hysteresis loop. Prager’s and Ziegler’s kinematic hardening models cannot 
simulate the ratcheting effect. 

Recent developments in plasticity theory have been focused on modeling 
of cyclic loading reversal. We shall discuss some of such developments in 
the following three sub-sections. 

Mroz’s Multisurface Model. Mroz 6 6 (1969) introduced a multi-surface 
model to describe the cyclic effect and the nonlinearity of stress-strain loops. 
The model uses a number of yield surfaces 

(1) fi(a H - a kl ,Ki) - 0 i = 0,1,2,...,n 

to simulate nonlinear hardening and smooth transition from elastic to plas¬ 
tic deformation. Each of the yield surfaces has its unique plastic modulus Ki 
and back stress aij to represent a combination of linear isotropic hardening 
and linear kinematic hardening. For a virgin material, the yield surfaces are 
concentric and similar but of different size. Mroz’s model approximates 
the nonlinear stress-strain curve of plastic deformation by piecewise linear 
approximation. During loading fo yields first. As loading continues, /o 
expands and translates while other surfaces remain stationary. When fo 
touches /1 at a point A, fo and f\ will move and deform in unison for 
further loading if the load point stays at point A. The plastic deformation 
continues until fi touches /2 at the same contact point or the load point 
moves away from the contact point. When fo, f i, and /2 are in contact, 
they will move together under further loading if the load point remains at 
A. This process continues. If the load point moves away from A but remain 
on the surface fo = 0, then fo may move away from f\ for further loading. 
In this case fi, f 2 ,■■ ■ will remain stationary until fo touches f\ again. Then 
the process will proceed as just described. It is noted that upon unloading, 
the loading vector moves away or is already away from / 0 , and all yield sur¬ 
faces remain stationary. This means that unloading occurs simultaneously 
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for all yield surfaces and that the contacted surfaces will remain in contact. 
During reloading fo will yield first again and the process will continue as 
described before. 

With a sufficient number of yield surfaces, Mroz’s model can describe 
nonlinear stress-strain loops, the Bauschinger effect, and cyclic hardening 
or softening of the material. The model requires extensive computational 
effort in implementation. For more details interested readers are referred 
to Mroz (1969, 1976), Prevost (1978), Mroz et cd. (1979). 


Dafalias and Popov’s Two-Surface Model. Mroz’s model cannot sim¬ 
ulate ratcheting and mean stress relaxation in strain cyclic loading with 
nonzero mean. Uniaxial experimental data indicate that, even under com¬ 
plex loading, the stress strain curves tend to converge with definite bound¬ 
ing lines. This suggests the existence of bounding surface(s). The plastic 
modulus E p (= J^j) depends on the distance from the stress-strain curve 
to the bounding line as well as the recent loading history. Dafalias and 
Popov 6 - 6 (1975) proposed a two-surface model to account for these effects. 
One surface is similar to the conventional yield surface and the other is a 
bounding surface. For uniaxial loading, the yield function f and bounding 
Junction f are in the following forms 

(2) / = (p - a ) 2 - Oy = 0, /= (d - a ) 2 - d\ = 0, 


where a and d are the stresses on / and /, a and a their centers, while oy 
and dy their respective sizes. Their incremental relations are 


( 3 ) 

, „ do 

de p - — , 

K p ’ 

(flow rule) 

( 4 ) 

do = K p de p ■ 

K p , 

= Kf d ” 

(5a) 

K a 

da = K a de p =~sdo 
K" 

(kinematic hardening) 

(5b) 

K a 

da = K a de p = %-do 



Using the consistency equation df = 0 and df = 0, we obtain 


( 6 ) 


K a = K p — 


day 

de p 


K a =K p - 


doy 

de p 


Dafalias and Popov 6 6 assumed that K p varies continuously, as opposed to 
the piecewise-constant plastic moduli in Mroz’s model, in the form 


( 7 ) 


K p = K p (6 in ,6) 
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where d = [(cr — cr)(cr - cr )] 1 / 2 = |<7 — a | is the distance between the present 
state (on the yield surface), a, and the stress state, a, on the bounding 
surface, with the same outward normal. The quantity 5i n is the value of 5 
at the initiation of a new loading process and measures how far the material 
state is from the state represented by the bounds. This is used to reflect 
the effect of loading history. To specify K p , Dafalias and Popov 6 6 (1976) 
proposed that 

(8) = + 

where a and b are constant and K p is the limiting values of the bounds 
defined as These quantities are determined experimentally. Note that 
K v (5i n , Si n , e e) = oo is specified to give a smooth elastic plastic transition. 
We can now calculate de p , da and da for given da if the isotropic hardening 
characteristics e jj^- and r jj^- are known. 

To generalize into the multiaxial case, the yield surface / and the bound¬ 
ing surface / are written in the following forms: 

(9) /(cry - ay, k) = f^ay- - ay) - k — 0 , 

( 10 ) /(dy - ay, k) = F(dy - dy) - k = 0 , 

where dy is a stress on the bounding surface corresponding to the stress 
state cry on the yield surface. When unloading occurs, cry and dy move 
away from the yield and bounding surfaces in unison. If / and /are similar, 
then dy — dy = T 7 (cry — ay). The flow rules are 

,,, .i. p nijUkidaki riijUkidaki 

ae ij — J ^ ^ 

which gives the relation between day and ddy. The model assumes that 
day day — dp(ay a y), i\ -— - — -— . 

(J&ij 0O ij 

Using Eq. (8), the flow rules Eq. (11), and the consistency equations de¬ 
rived from Eqs. (9) and (10), we can express ddy, dfi, daij,daij in terms of 
dcry, K p , dii/dfP and dn/de?. The last three quantities are usually deter¬ 
mined by experiments. In Eq. (8), S is defined as 



The two-surface model is smooth at the elastic-plastic transition point, 
but produces an inconsistency in the unaxial load-unload-reload situation 
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if the unloading step is very small (Chaboche 1986). This model, like the 
multiple-surface model, requires extensive computational effort in imple¬ 
mentation. Interested readers are referred to papers by Dafalias and Popov 
(1975, 1976), Dafalias (1984), and Chaboche (1986). 


Valanis’s Endochronic Theory. The plasticity theories discussed so far 
are based on the concept of yield surface, which divides the elastic and plas¬ 
tic domains. It implies a sharp demarcation between the two domains. As 
pointed out before, many materials do not exhibit a sharp yield point. Some 
materials such as aluminum and stainless steel have a nonlinear uniaxial 
stress-strain curve almost from the start. Valanis 6 6 (1971, 1980) proposed 
a plastic theory in a convolution integral form for infinitesimal deformation. 
The concept is that the present state of the material depends on the present 
values and the past history of observable variables, giving rise to hereditary 
theories. The theory is called the endochronic theory , which assumes 

r 2 de p • 

(12) a' i:j = 2 G J p(z - z')-^f dz ', 


in which yield surface is not a prior defined rather a derivable result of the 
theory. The kernel function p(z) is a material function and z is the intrinsic 


time defined as 
(13) 



where /(c) is a nonnegative function, called the intrinsic time scale, with 
/(0) = 1 and c defined as 



Let 

(15) p(z) = p 0 S(z) + pi(z), 

where 6(z) is the Dirac delta function, po is a material constant and pi(z) 
is a nonsingular function. A substitution of Eqs. (13) and (15) into Eq. (12) 
gives 

de p 

(16) < j =S Q Y -^-f{,) + a ij , 
where 


(17) S £ = 2G P0 , 

f z dc^ • 

(18) oca = 2G^ pi(z - z')~£p- dz'. 
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The assumption of the kernel function p(z ) in the form of Eq. (15) 
effectively divides the constitutive equation, Eq. (16) in two parts: the 
part represented by the first term on the right hand side of the equation 
depends on the present values of observable variables, such as Sy, and 
a set of internal-state variables, which characterize both Sy and de^/dz. 
The second term aij is equivalent to the back stress introduced previously, 
which depends on the past histories of these variables. 

From Eqs. (14) and (16), we obtain 


(19) 

«,' - a ij)Kj ~ a ij) ~ [Sy/M] 2 = 0 - 

( 20 ) 


Note that 


( 21 ) 

( a ij - a ij) de ij = s yf (0 * > 0 . 


Equations (19) and ( 20 ) correspond to the yield function and the flow 
rule, respectively. Equation (19) shows the characteristics of combined 
isotropic and kinematic hardening. If /(«,') = 1, Eq. (19) corresponds to the 
von Mises yield surface. In this case, the plastic strain increment de?- is 
normal to the yield surface. 

Different plasticity theories can be derived through different choices of 
the kernel function p\{z) and the intrinsic time scale /(c). Valanis (1980) 
proved that general form for Pi(z) is given by 

(22) Pl {z) = Y^P 1 j e '^ z , 

1=1 

where pij,£j are material constants to be determined experimentally. Two 
commonly used forms of /(£) for isotropic hardening are 

(23) /(C) = 1+7C. 
and 

(24) /«) = a + (1 - a)e -0C , 

where a,/3, 7 , are material constants. The second form reaches a limit 
asymptotically as ( increases and is therefore called the saturated form. 
Many features of the endochronic model are described in detail in Watanabe 
and Atluri 6 6 (1986). 
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6.14. STRAIN SPACE FORMULATIONS 

We have thus far discussed the plasticity theories formulated in the 
stress space only. This approach has two inherent disadvantages: for ideal 
plastic materials, a different approach from that for hardening materials 
is needed to derive the incremental stress-strain relation; and for softening 
materials, confusion can arise between further plastic loading and elastic 
unloading (Naghdi and Trapp 6 7 1975a, 1975b). Parallel to the stress space 
approach discussed in Sec. 6.11, H’iushin 6 7 (1961) introduced a strain space 
formulation to eliminate these drawbacks. Similar to Drucker’s postulate, 
Il’iushin hypothesized that the work done by external forces on a material 
over a closed strain cycle is nonnegative. In other words, plastic deformation 
occurs during a closed strain cycle if the work of external forces over the 
strain cycle is positive. A closed strain cycle is defined as a closed path in 
the strain space that the strain starts from an equilibrium and compatible 
state eo, experiences a change due to external loading, and then reverts to 
the original eo. Under Il’iushin’s hypothesis, it can be shown that the work 
done by external forces over a closed strain cycle (the area of ABODE) is 
larger than or equal to that done over a closed stress cycle (the area of 
ABCD). See Fig. 6.14:1. 



Fig. 6.14:1. Closed strain and stress cycles. 


Let cr p denote the plastic stress defined in 

(1) &ij ~ Dijkiekl — = Dijkl{ e kl ®fcl) ! 

where Oij , e*,;, are the stress, total strain, and plastic strain tensors, 

respectively, and Dijki is the fourth-ordered elastic modulus tensor. Then 
the plastic stress tensor is defined in terms of the plastic strain tensor e p t 
in the form 
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or in an incremental form 


(2) daf j = D ijk ide p kl , 

if the elastic modulus tensor remains constant in the course of plastic de¬ 
formation. Based on his hypothesis, Il’iushin showed that 

(3) = and 

where /(e, cr p , a e , k) = 0 is the yield surface, rj, A are positive scalar factors 
and hij is a unit normal to the yield surface defined as 


( 4 ) 


V -1/2 

df_ df_\ 
deij deij J 


ot e is called the back strain tensor governing kinematic hardening, while 
k is a parameter associated with isotropic hardening. Equation (3) is the 
flow rule in the strain space formulation equivalent to Eq. (6.11:5) in the 
stress space formulation. 

The yield function forms a closed surface in strain space enclosing an 
elastic region with the following features. 

• No change in plastic deformation occurs as long as / < 0. 

• Change in plastic deformation occurs when / = 0. 

• No meaning is associated with / > 0. 


Yield surfaces in stress space are more readily available. We can obtain 
the yield surfaces in the strain space by substituting Eq. (1) into the yield 
functions in the stress space: 

(5) /(cr, a, k) = f{Dijkie k i - cr T - k) = /(e, <r p , a e ,«) = 0 

where (a e )y = and a e and k can be functions of a p . For linear 

kinematic hardening with gijki being a 4th-order tensor, 

(6) di((): c ),y = gijkida H . 


If k is a function of the equivalent plastic stress o p with increment 


( 7 ) 

then 

( 8 ) 


do 1 ’ 




do p -dad ~ A\j , 


dn = ^=do p 

uO" g 
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Hgijki = c Dijl q D p ^ k „ where D rj ). s D rskl = {8 ik 5ji + 8 a 6 jk )j2 that is 
the inverse of D, Eq. (6) is Prager’s model (6.12:16). For isotropic materials 

D «" =2G (-2- + T^7j' d «" = 2g(^ 1-mr)' 


8 & + (1 + vf 4C 2 


i ( a e)ij — 4( ^2 > a ij ~ ^ &P ij 


The consistency equation can be derived from Eq. (5), i.e., 

<n\ . i d f j ,9f df df 

(9) 4 ,= % 4if+ ^ + 9w; , ' Wit s'' =0 ' 

A substitution of Eqs. (6)-(8) into Eq. (9) yields 

(10) A = , 


/ <9/ a/ / 5/ A 0/ . a/ dtt /2\ 

V de k i \ da ij Ulj + 9ljkl d ( a e)ki Ulj + 9 k dal V 3 J 

The flow rule Eq. (3), together with Eq. (2) can now be written as 


» « Hk! de k i ^ 

da-. — ———Tin or 


de ij = Dijrsi 


The incremental stress-strain relation becomes 


d(Tij — f Dijkl 


Using the yield surface Eq. (5) and noting that 


df _ n df 

- Ui i kl A„. 


df df df 

dcTki ’ daf- dOij ’ 


df n df 
d(a e )ij ljkl da k i 
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we can express quantities in strain space in terms of quantities in stress 
space: 

(14) fiij = —- Dijki n ki -_____ (unit normal to yield surface) 

\U rs pqTlpqU rsuv Tluv) ' 


(15) 

(16) 


(17) 


where 


Dij rs Ti r sdd-ij Dabkl^'kiDabuv^uv (scalar factor) 

i p DijrsTlrsDkluv'H'uvdCfcl 

K = — - i? - 

(incremental plastic stress: flow rule) 

i ( t-» Dij uv Tl uv Di c lrs'^'rs \ j r\cp j 

d(Jij — \Dijkl Yjp 1 de kl — ^ijkl d ^kl 

(incremental stress-strain relation) 


(18) 

(19) 


L — Gijkl^klGijuv'H’uvL DijklTlijflkl “p AT , 

v p _ n n ft ( °f d/ \ 

n ~ n ^ijkl9klrsVrsuvn uv ^ ^ yj g J 


- 1/2 


where ny is a unit normal to the yield surface in stress space defined in 
Eq. (6.11:4) and do? = de p \/D a bkirikiD a buvn uv . For isotropic materials, 


A = 


AG 2 nijdei 

1/ : 


doij — A &kkSij “j - 2G | (ic 


2 Griij deki 

L? 


L p = 2G + K v , 


d£d, k p ( df Of y l/2 
dn de v e V 3 \ daw dcr^ ) 


It can be shown that K p has the same definition as Eq. (6.11:13) in the 
stress space formulation. For ideal plasticity, K p = 0 causes the breakdown 
of the flow rule in the stress space formulation. Since L p is nonzero, there 
is no such problem in the strain space formulation. 


Example. For isotropic von Mises materials, the yield surface in stress 
space is 

( 20 ) = o 

with erb being the deviatoric stress and a l3 = ce? for Prager’s linear kine¬ 
matic hardening model defined by Eq. (6.12:16). For infinitesimal deforma¬ 
tion 


4 = 2 < 7(4 - 4 ) = 264 - < 
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where eC is the deviatoric strain, the corresponding yield surface in strain 
space can be written as 


(21) f{eij - a ijt k) = /(< j'ij - ay, k) 

— 2G 2 ( e ' k i - a k i){e' kl — an) — = 0 > 

where ctu = <r?, = 0 and 


&.. = Zii if a -^L + S!ii = ^i + ffk - A t c 

y 2C r + ( 2G + 2G 2G + 2G + 2G 


} 2 G C 


Note that the material exhibits linear kinematic hardening in strain space 
even if there is no kinematic hardening in the stress space, i.e., a k i = 0. 
Then 


de, 

diTijdaij 3 y ’ de'i^v 3 Y dal de °’ 

6G 2 


df As~i2r / / \ a y/6G 

S^=4G(« tf - a,,), % = — 


( e y ^ti) i 


d&ij — 2 Gdeij 4 - Xde kk Sij ^^ otki)de k i 

,, /, c n^A / c 2(?<7y\ 1 

da ^-( 1 + 2G)-2G’ L = ( 1 + 5G + 3-^rj2G- 


These equations are valid for both hardening and ideal plastic materials. 
For further details, readers are referred to the references such as Il’iushin 6 7 
1960,1961, Naghdi and Trapp 6 7 1975(a), 1975(b), 1981, Yoder and Iwan 6 7 
1981, Yoder 1981. 


6.15. FINITE DEFORMATION 

Considerable attention has been given to the study of finite elastic- 
plastic deformation in recent years. For finite deformation, it is necessary 
to distinguish between stress and strain measures defined with reference 
to the deformed and the undeformed states of a continuum. In formu¬ 
lating a plasticity model, stress and strain rates are required. Unfortu¬ 
nately, depending on the definition, a rate may or may not be objective 
(independence of coordinate transformation) in large deformation. A key 
element in finite plasticity formulation is the choice of appropriate objective 
rates that will correctly reflect the underlying physics of the plastic flow. 
The choice of rates also affects strongly the form of constitutive equations. 
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In general, finite deformation will also produce elastic-plastic coupling, thus 
may make the yield surface nonconvex. The subject of the best choice of 
rates is still a topic of research. Detailed discussion of finite plasticity theory 
is beyond the scope of this book. 

6.16. PLASTIC DEFORMATION OF CRYSTALS 

The underlying physics of plastic flow for metals is the micro-slip of 
crystallographic planes. Micro-slip in crystals or grains is highly directional 
and nonuniform. Dislocation theories and the plasticity theories for single- 
and poly-crystals have provided the basis to quantify plastic deformation at 
the microscopic level and explain how and why plastic deformation occurs. 
The approach is based on the movements of atoms and the deformation 
of crystals and grains along slip planes. The macroscopic movements of 
metals are the aggregates of single- and poly-crystal responses to applied 
loads. This physical theory is a very important aspect of plasticity theory. 
Interested readers are referred to the references listed in Bibliography 6.9 
at the end of the book for details. 


PROBLEMS 

6.8. A solid is assumed to obey the von Mises criterion with isotropic hard¬ 
ening. If the virgin curve in uniaxial tension can be described in the infinitesimal 
deformation range by ay = F(e p ), determine 


def- = A 


df__ 

dan 


when k is assumed to depend on either e p , or W p . 
6.9. If the yield function is 


/ = \Ah -k = 0 , 


with 


•I‘2 'T ( a l j Otij) , Otij ce , 


where k and c are constant, show that 


Qtij 


c ( a ij ) 


{a' kl — ciki)a'ki 


2 K 2 



2 dn \ 

Tide?) ' 
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6.10. Generalize the preceding result to the case where k depends on ej, 
determine the rate equations for both a< 3 and e v i} . 

6 . 11 . Consider testing a certain metalic material in two ways: by torsion of a 
hollow cylindrical specimen as shown in Fig. P6.11(a), and by uniaxial stretching 
of a tension specimen as shown in Fig. P6.11(b). The torsional test shows that 
the load-deflection curve is linear for a shearing stress below 25,000 lb/sq-in and 
that yielding occurs at the stress 25,000 lb/sq-in. If the criterion of yielding for 
this material is von Mises’ J 2 = k 2 , what is the expected value of the tensile 
stress at which yielding will occur in the tension test? 



(a) (b) 


Fig. P 0 . 11 . 


6.12. Consider an isotropic, incompressible, but nonlinearly elastic material. 
Let a'ij be the strain deviation and stress deviation, respectively. If the 
material were elastic, then 

6,3 = 2G a ' J ' 

This can be written, of course, as a matrix equation, 


e ll 

e 12 

e 13 

1 

'<711 

<7(2 

< 7 l 3 

e 21 

e 22 

e 2 3 

= 2 G 

< 7 2 1 

<722 

< 7 2 3 

. e 31 

632 

633 . 

.<731 

<732 

<733 _ 


or, in short 

e ' = 2 h a ’- 

To generalize such a relation to a nonlinear elastic material, we assume that 
the matrix e' can be represented as a power series in er', 

e' = Cut' + C 2 <r' 2 + C 3 <r' 3 + • • • . 
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6.13. Prove, by an application of the Cayley-Hamilton theorem, that the 
most general nonlinear elasticity law of this type can be reduced to the form 

e' = Per' + Qcr' 2 . (Prager) 


6.14. In soil mechanics, Coulomb’s yield condition is widely used. Consider 
a two-dimensional problem. If or and 02 are the principal stresses in the plane 
of concern, Coulomb’s yield condition is specified by the yield functions 

fi = 0i(l + sin </>) - 02(1 — sin 4>) — 2ccos , 

/2 = - 0 i (1 - sin</>) + 0 2 (1 + sin0) - 2ccos <j>, 

where c = cohesion and <j> = angle of internal friction. The soil is elastic if both 
/1 and fi are negative, and it is plastic when /1 or / 2 , or both vanish. 

(a) Show that the flow rule derived from this condition by means of Koiter’s 
theory of generalized plastic potential requires that the rate of the planar 
dilatation e{'^ + eif - and the maximum shear rate |ej P) — e^\ have the 
constant ratio sin </> for all states of stress at the yield limit, except the 
state 01 = 02 = ccos (j>. 

(b) Generalize the results above to obtain the generalized plastic potentials 
for three-dimensional problems. 

(c) According to your generalization of (b), is there any general relationship 
between the three-dimensional dilatation and the maximum shear rate 
for stress states at the yield limit? 

6.15. For the yield surface as defined in Eq. (6.14:5), show that 

DijklTlkl Dijpqll-pq — 1 / s/DijlfiklDijlg flpq . 


6.16. For the equivalent plastic stress {dal) and strain {del ) as defined in 
Eqs. (6.14:7) and (6.12:4), respectively, show that 

dal ' delylDijkiTikiDijpqTipq . 


6.17. For an = Dijki{oie)ki,d(a e )ij = gijkida% n del as defined in Eq. (6.12:4), 
and the yield function as defined in Eq. (6.14:5), show that 


(a) docij — DijkiQktraDrspqdGpq! and 

(b) Equation (6.14:19) becomes 


K p = 



Dijkigklrs DrspqTlpq 


+ 


t l~2 \ f df df \ 

8k del V 3 J V dak 1 daki ) 


- 1/2 


which is the same as that given in Eq. (6.11:13) in the stress space 
formulation. 
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LINEARIZED THEORY OF ELASTICITY 


In this chapter we shall discuss the classical theory of elasticity. We shall 
discuss the general structure of the theory and illustrate the applications 
of the linearized theory by a few examples. 

Rectangular Cartesian coordinates of reference will be used through¬ 
out. The coordinates will be denoted by xi,x 2 ,X 3 or x,y,z unless stated 
otherwise. 


7.1. BASIC EQUATIONS OF ELASTICITY FOR 
HOMOGENEOUS ISOTROPIC BODIES 


An elastic body has a unique zero-stress state, to which the body returns 
when all stress vanish. All stresses, strains, and particle displacements are 
measured from this zero-stress state: their values are counted as zero in 
that state. 

There are two ways to describe a deformed body: the material and the 
spatial (see Sec. 5.2). Consider the spatial description. The motion of a 
continuum is described by the instantaneous velocity field Vi(xi,x 2 ,X 3 ,t). 
To describe the strain in the body, a displacement field Ui(xi,x 2 , X 3 , t) is 
specified which describes the displacement of a particle located at xi,x 2 ,x 2 
at time t from its position in the natural state. Various strain measuios 
can be defined for the displacement field. The Almansi strain tensor is 
expressed in terms of m(xi,x 2 ,X 3 ,t) according to Eq. (4.2:4), 


( 1 ) 


q Q . du i 

dujdxi + — 


du k du k 
dxi dx.< 


The particle displacements it* are functions of time and position. The 
particle velocity is given by the material derivative of the displacement, 


( 2 ) 


dui dui 
dt V] Ox, 


The particle acceleration is given by the material derivative of the velocity 
( 5 . 2 : 7 ), 


(3) 


dvi di’i 
a% dt ~ Vj dxi 
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The motion of the body must obey the equation of continuity (5.4:3) 


dp djpvj) 
dt dx 


and the equation of motion (5.5:7) 

, _ d<*a 

( 5 ) pa t = — + Xi. 

In addition to the field Eqs. (4) and (5), the theory of linear elasticity 
is based on Hooke’s law. For a homogeneous isotropic material, this is 
Eq. (6.2:7). 

(6) o = \( I' 2 , 


where A and G are constants independent of the spatial coordinates.* 

The famous nonlinear terms in Eqs. (l)-(3) are sources of major diffi¬ 
culty in the theory of elasticity. To make some progress, we are forced to 
linearize by considering small displacements and small velocities, i.e., by 
restricting ourselves to values of u iy Vi so small that the nonlinear terms in 
Eqs. (l)-(3) may be neglected. In such a linearized theory, we have 


(7) 


e ij — u ia) > 


( 8 ) 



OCi = 


dvj 

dt 


Unless stated otherwise, all that is discussed below is subjected to this re¬ 
striction of linearization. Fortunately, many useful results can be obtained 
from this linearized theory. 

Equations (l)-(6) or (4)- (8) together are 22 equations for the 22 un¬ 
knowns p , Ui, Vi, Oi, eij, Gij. In the infinitesimal displacement theory we 
may eliminate Oij by substituting Eq. (6) into Eq. (5) and using Eq. (7) to 
obtain the well-known Navier’s equation, 


(9) A 


Guijj -j- (A + G)ujji + ATi 



This can be written in the form 

(10) A GV 2 u i + (X + G)e ii + Xi=p^, 


*The corresponding equations based on the material description are the following: 
velocity and acceleration, Eqs. (5.2.3) and (5.2:6); strain measure, the Green’s strain 
tensor, Eq. (4.2:5); the equation of continuity, Eq. (5.2:3); stress tensors, Sec. 16.7; the 
equations of motion, Eqs. (16.10:1-8); the stress-strain laws, see Sec. 16.11; in particular, 
Eqs. (16.11:6) and (16.11:7). It can be seen that the kinematical relations appear simpler 
in the material description, but the equations of motion appear more complicated. 
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(12) V Ui — Uijj . 

The quantity e is the divergence of the displacement vector Ui. V 2 is the 
Laplace operator. If we write x, y, z instead of X \, x?, 2 : 3 , we have 
, „ du dv dw 

(13) e= ^ + ^ + a7’ 


, , „ a d' 2 9 2 d 2 

_ dx 2 + 8y 2 + dz 2 ' 

Love 1 ' 2 writes Eq. (10) in the form, 

(15) CV 2 («, u, w) + (A + G) ( J;, §~ y , e + (X, 7, Z) 

02 / \ 

= PQp{ U ’ V ,' W ), 

which is a shorthand for three equations of the type 
(!6) A GV 2 u + (A + G)— + X = p-^ . 


This can also be written as 


_ ( 2 1 de\ 9 2 u 


v ' \ 1 -2u dx 

If we introduce the rotation vector 


= -curl(u, u 

1 w) 



1 (dw 

9?; du 

dw dv 

9u\ 

s 

CM 

II 

dz ’ <9z 

dx ’ dx 

dy) 


and use the identity 

(19) V 2 {u,v,w) = (J^>§^’§^e-2cuvl(u> x ,u y ,u z ), 
then Eq. ( 10 ) may be written as 

(20) (A + 2G) ^,-~ J ^je-2Gcm\(uj x ,u Jy ,u> z ) + (X,Y,Z) 


9 2 , , 
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7.2. EQUILIBRIUM OF AN ELASTIC BODY UNDER 
ZERO BODY FORCE 

Consider the conditions of static equilibrium of an elastic body. If the 
body force vanishes, Xi = 0, then by taking divergence of Eq. (7.1:9), 
we have 


Guijji + (A + G)ujja — 0, 


or 

(1) ( u j,j),H = 0- 


In unabridged notations for rectangular Cartesian coordinates, this is 
d 2 d 2 d 2 \ 


( 2 ) 


dx 2 dy 2 


8z 2 J 


6 = 0 , 


V 2 e = 0. 


Equation (2) is a Laplace equation. A function satisfying Eq. (2) is called 
a harmonic function. Thus, the dilation e is a harmonic function when the 
body force vanishes. 

But 


(3A + 2G)e — <t xx 4- cTyy + a zz = 3<r, 

where a is the mean stress. Hence, the mean stress is also a harmonic 
function: 

(3) V 2 cr = 0 . 

If we put X = 0, d 2 u/dt 2 = 0, and operate on Eq. (7.1:16) with the 
Laplacian V 2 , we have 

(A + G)^-V 2 e + GV 2 V 2 u = 0 . 
ax 

With Eq. (2), this implies that 

(4) V 4 u = 0, 
where in rectangular Cartesian coordinates, 

v 4 - — — — ° l 94 94 

dx 4 ^ dy 4 ^ dz 4 dx 2 dy 2 ^ dy 2 dz 2 + dz 2 dx 2 

Equation (4) is called a biharmonic equation, and its solution is called a 
biharmonic function. Hence, the displacement component u is biharmonic. 
Similarly, the components v, w are biharmonic. It follows that when the 
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body force is zero, each of the strain components and each of the stress 
components, being linear combination of the first derivatives of u, v, w, are 
all biharmonic functions: 


( 6 ) 

V 4 tr — 0 

( 7 ) 

V 4 eii - 0 


7.3. BOUNDARY VALUE PROBLEMS 

Navier’s Eq. (7.1:9) combines Hooke’s law and the equation of motion. 
It is to be solved for appropriate boundary and initial conditions. The 
boundary conditions that occur are usually one of two kinds: 

A. Specified displacements. The components of displacement u l are 
prescribed on the boundary. 

B. Specified surface tractions. The components of surface traction T) 
are assigned on the boundary. 

In most problems of elasticity, the boundary conditions are such that over 
part of the boundary displacements are specified, whereas over another part 
the surface tractions are specified. Let the region occupied by an elastic 
body be denoted by V. Let the boundary surface of V be denoted by S. 
We separate S into two parts: S u , where displacements are specified; and 
S a , where surface tractions are specified. Therefore, on S a , 

Ti= (JijUj — a prescribed function, 

where Vj is a unit vector along the outer normal to the surface S a . By 
Hooke’s law, this may be written as 

+ G(u ZiJ + Uj t i)]uj = a prescribed function. 

Hence, over the entire surface, the boundary conditions are that either 
Ui, or a combination of the first derivatives of u z , are prescribed. 

In dynamic problems, a set of initial conditions on it, or <Jij must be 
specified in the region V and on the boundary S. 

The question arises whether a boundary-value problem posed in this 
way has a solution, and whether the solution is unique or not. The question 
has two parts. First, do we expect a unique solution on physical grounds? 
Second, does the specific mathematical problem have a unique solution? In 
continuum mechanics, there are many occasions in which we do not expect a 
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unique solution to exist. For example, when a thin-walled spherical shell is 
subjected to a uniform external pressure, the phenomenon of buckling may 
occur when the pressure exceeds certain specific value. At the buckling 
load, the shell may assume several different forms of deformation, some 
stable, some unstable. On the other hand, our everyday experience about 
the physical world tells us that the vast majority of mechanical cause- 
and-effect relationships are unique. Theoretically, the physical question is 
partly answered by thermodynamics. But the mathematical question must 
be answered by the theory of partial differential equations. A satisfactory 
theory must bring harmony between the mathematical formulation and the 
physical world. 

In the preceding discussions we have taken the displacements iq as the 
basic unknown variables. In problems of static equilibrium, however, it is 
customary to use an alternate procedure. The equations of equilibrium are 
first solved for the stresses <7y. We then use Hooke’s law to obtain the strain 
eij. This solution will not be unique. In fact, an infinite set of solutions 
will be found. The correct one is then singled out by the conditions of 
compatibility. Only the one solution that satisfies both the equations of 
equilibrium and the equations of compatibility corresponds to a continuous 
displacement field. This procedure becomes very attractive when stress 
functions are introduced, which yield general solutions of the equations of 
equilibrium (see Sec. 9.2). 

By means of Hooke’s law, the compatibility equation 
(1) ^ij,kl + &kl,ij &ik,jl &jl,ik = b 

can be expressed directly in terms of stress components. On substituting 


_ l + is v 

e ij ~ jjj gVVij > 

into Eq. (1), we obtain 


6 — <?kk i 


(2) a ij,kl + &kl,ij @ik t jl &jl,ik 

— ” ; ( fiijO'kl T • 

J. *T~ IS 

Since only six of the 81 equations represented by Eq. (1) are linearly inde¬ 
pendent, the same must be true for Eq. (2). If we combine Eqs. (2) linearly 
by setting k = l and summing, we obtain 


&ij,kk T &kk,ij &ik,jk &jk,ik 


1 +V 


[fiijQ'kk + 3kk@ f ij ^ik^Jk &jk@,ik) > 


(3) 
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which is a set of nine equations of which six are independent because of 
the symmetry in i and j. Hence, the number of independent equations 
is not reduced, and Eqs. (2) and (3) are equivalent. Since (Jkk — 0 and 
&ij,kk = Wy, if we use the equation of equilibrium to replace, say, <Jik,kj 
by we can write Eq. (3) as 


(4) + ^0.0- - ^V’0 = ~(X itj + X hl ), 

where Xi is the body force per unit volume. In dynamic problems the 
inertia force should be included in Xi. With a contraction i = j, Eq. (4) 
furnishes a relation between V 2 # and Xi tl . If this is used to transform the 
third term in Eq. (4), we obtain 


(5) A V 2 c ij + j-j— ~0,ij = ' ^ _T ^ h X k,k — (Xij + Xj.i) ■ 

Written out in extenso, these are 


( 7 ) 


„ 2 . 1 a 2 0 * fdX dY dz\ 

xx + 1 + 1 / dx 2 1 -v\dx + dy + dz) 

2 _ (dX_ dY dZ\ 

° vv ^ 1 + v dy 2 1 - v \ dx dy ^ dz ) 

_ 2 i a 2 e v (dx dY dz\ 

° zz + 1 + v dz 2 1 — v\dx ^ dy dz ) 

_ 2 1 d 2 9 fdY dZ\ 

<Tyz + 1 + v dydz \dz ^ dy ) ' 

<Jzx + j v q z q x \dx ^ dz ) ' 

2 i a 2 e /ax ar\ 

Ux v + j _j_ v Q x Qy \ dy + dx ) ' 


-2 


- 2 


dX 

dx 

dY 
dy ’ 



These equations were obtained by Michell in 1900, and, for the case in 
which body forces are absent, by Beltrami in 1892. They are known as the 
Beltrami-Michell compatibility equations. 

For a simply connected region, satisfaction of the Beltrami-Michell 
equations implies that the stress system Oij is derivable from a contin¬ 
uous displacement field. If the region concerned is multiply connected, 
additional conditions in the form of certain line integrals must be satisfied 
(see Sec. 4.7). 
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7.4. EQUILIBRIUM AND UNIQUENESS OF SOLUTIONS 

Consider the problem of determining the state of stress and strain in a 
body of a given shape which is held strained by body forces Xi and surface 

V 

tractions T,. Let us assume that a function W(e n,ei 2 ,... ,e 33 ), called the 
strain energy function of the elastic material , exists and has the property 


(1) 


dW 

q — * 

deij 


For example, if the material obeys Hooke’s law [Eq. (6.1:1)], then 


( 2 ) 


IF — 2 • 


The existence and the positive definiteness of the strain energy function 
for an elastic body are discussed in Chapter 12. In Sec. 12.4, it is shown 
that W is positive definite in the neighborhood of the natural state. A 
natural state of a material is a stable state in which the material can exist 
by itself in thermodynamic equilibrium. Limiting ourselves to consider 
materials which have a unique natural state in the theory of elasticity, we 
are assured of a positive definite strain energy function in the neighborhood 
of the natural state. 

The equations of equilibrium eq^j + X l = 0 may be written in terms of 
Was 


(3) 



+ Xi = 0 . 


The boundary conditions over the boundary surface S u + S„ are: 


(4a) Over S u , the values of iq are given, 

" dW 

(4b) Over S a , the tractions -—rq are specified 

deij 

(see Sec. 7.3). If S a constitutes the entire surface of the body, it is obvious 
that equilibrium would be impossible unless the system of body forces and 
surface tractions satisfy the conditions of static equilibrium for the body as 
a whole. In the same case, the displacement will be indeterminate to the 
extent of a possible rigid-body motion. 

We shall prove the following theorem due to Kirchhoff. If either the 
surface displacements or the surface tractions are given , the solution of 
the problem of equilibrium of an elastic body as specified by Eqs. (l)-(4) 
is unique in the sense that the state of stress (and strain) is determinate 
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without ambiguity , provided that the magnitude of the stress (and strain) is 
so small that the strain energy function exists and remains positive definite. 

Proof. Since a strained state must be either unique or nonunique, a proof 
can be constructed by showing that an assumption of nonuniqueness leads 
to contradiction. Assume that there exist two sets of displacements u' 
and u'l which define two states of strain, both satisfying Eq. (3) and the 
boundary conditions (4a) and (4b). Then the difference Ui = u^—u'! satisfies 
the equation 


(5) 


(SW\ 

KfeijJj 


and the boundary conditions that 


dW 


(6) Ui = 0 on S u and u, =0 on S a 

oet 


Prom Eq. (5) we have 


X 


(dW\ 

V ® e ij ) t j 


dV = 0 , 


which, on integrating by parts, becomes 

f dW f dW 

(7) Lter i uudV=0 - 


The first surface integral vanishes because of the boundary conditions (6). 
The second volume integral may be written as 


r dW 
J v 


:dV. 


Now, when IP is a homogeneous quadratic function of ey [Eq. (2)], the 
integral above is equal to J 2 WdV. Since W is assumed to be positive 
definite, the integral / WdV cannot vanish unless W vanishes, which in 
turn implies that = 0 everywhere. Hence, Uj = u[ — u" corresponds to 
the natural, unstrained state of the body. Therefore, the states of strain 
(and stress) defined by it' and u” are the same, contrary to the assumption. 
Hence, the state of strain (and stress) is unique. Q.E.D, 


We must note that the uniqueness theorem is proved only in the neigh¬ 
borhood of the natural state. In fact when the strain energy function fails 
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to remain positive definite, a multi-valued solution or several solutions may 
be possible. 

Problem 7.1. Prove Neumann’s theorem that the solution m(x,t), for x in 
V + S u -f Scr and t > 0, of the following system of equations, is unique. 


(8) 

d 

dxj 

fdW\ v d 2 Ui „ 

for x in V, t > 0, 

(9) 


Ui = fi(x, t) , 

for i on S u , t > 0 , 

(10) 


&W 1 A 

for x on S a , t > 0, 

(U) 


Ui = hi(x ), it, = ki(x ), 

when t = 0, x in V + S 

(12) 


1/ 

e o = 2 ^ +u hi)> 



where X % , f % , g it hi, ki are preassigned functions and W(eij) is a positive definite 
quadratic form. Hint: Multiply Eq. (8) by dui/dt and integrate over V and (0, t). 
Note that the kinetic energy is positive definite. (Reference, Love, Elasticity , 1,2 
p. 176.) 

Notes on Possible Loss of Uniqueness 

The uniqueness theorem of Kirchhoff is the foundation for the method 
of potentials (Chapters 8 and 9). For, when uniqueness of solution is estab¬ 
lished, one needs to find only a solution of a given boundary-value problem: 
that solution is the solution. 

But it is essential for our theory to be able to violate the uniqueness of 
solution in one way or another. We know that elastic columns can buckle, 
thin shells may collapse, airplane wings can flutter, machinery can become 
unstable in one sense or other. The word “stability” has many meanings; 
to define a stability problem we must define the sense of the word stability. 
A large class of practical stability problems is connected with the loss of 
uniqueness of solution. Under certain circumstances two or more solutions 
may become possible; some of these may be dangerous from engineering 
point of view or undesirable for the function of the machinery; and the 
circumstances are said to cause instability. 

The uniqueness theorem can be violated by violating any one of its as¬ 
sumptions. Referring to Kirchhoff’s theorem, we have two possibilities: 

(a) Loss of positive-definiteness of the strain energy function, W(eij), 

(b) Basic changes in the equations of equilibrium (or of motion). 
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The first possibility arises when the material becomes unstable, as yield¬ 
ing or flow occurs (cf. Sec. 12.5). It is relevant to the plastic buckling of 
columns, plates, and shells. 

The second possibility may arise in a variety of forms. The most im¬ 
portant are those due to 

( 1 ) finite deformation, 

( 2 ) nonconservative forces, and 

(3) forces that are functionals of the deformation or history of 
deformation. 

Most buckling problems can be understood only if we realize the basic 
changes in the equation of equilibrium introduced by finite deformation. 
For finite deformations the equation of equilibrium (or of motion) is given 
by Eq. (16.10:7) or (16.10:8). These equations are basically nonlinear be¬ 
cause the strains and rotations depend on the stresses. The correspond¬ 
ing equations of equilibrium or motion for a plate are given in Sec. 16.16, 
e.g., Eqs. (16.16:10) to (16.16:14). The situation with a column is similar. 
The linearized versions of these equations retain the basic features of these 
large-deflection equations, so that these problems generally become 
eigenvalue problems or bifurcationa problems. 

Nonconservative forces generally depend on the deformation of the 
structure in a certain specific manner. For example, consider an axial load 
applied to the end of a cantilever column. If this load is fixed in direction, 
it is conservative. If this load is not fixed in direction, but may rotate in the 
process of buckling, then it is nonconservative. The case (3) listed above is 
a special but broad class of nonconservative forces. It occurs commonly if a 
solid body is placed in a flowing fluid. The aerodynamic or hydrodynamic 
pressure acting on the body depends on the deformation of the body, and 
on the local and whole body velocity and acceleration. If the wake and 
vorticity are important, the aerodynamic pressure will depend also on the 
history of deformation. This is commonly the case of an aircraft lifting 

V 

surface. Under these loadings the terms Xi and X) in Eqs. (7.4:3), (7.4:4), 
and (7.4:8), (7.4:10) are functions or functionals of Ui(x\, X 2 , X 3 , t). 

In any of the cases mentioned above the basic equations differ from those 
assumed in the Kirchhoff theorem. Loss of uniqueness does not necessarily 
follow, but it becomes a possibility and must be investigated. 

7.5. SAINT-VENANT’S THEORY OF TORSION 

To illustrate the applications of the theory of elasticity, we shall consider 
the problem of torsion of a cylindrical body. A cylindrical shaft, with an 
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axis 2 , is acted on at its ends by a distribution of shearing stresses whose 
resultant force is zero but whose resultant moment is a torque T. The 
lateral surface of the shaft is stress free. See Fig. 7.5:1. 



Fig. 7.5 :1. Torsion of bars of elliptic and square cross section, as drawn by Saint-Venant. 


If the shaft is a circular cylinder, it is very easy to show that all plane 
cross sections normal to the 2 -axis remain plane, and the deformation con¬ 
sists of relative rotation 0 of the cross sections. The rate of rotation per unit 
axial length dOjdz is proportional to the torque T, with a proportionality 
constant equal to the product of the shear modulus G of the shaft material, 
and the polar moment of inertia J of the shaft cross-sectional area: 


( 1 ) 


GJ 


= T. 


The only nonvanishing component of stress is the shear in cross sections 
perpendicular to z, whose magnitude is 


(2) 


r — 


Tr 
J ’ 


where r is the radius vector from the central axis z. 
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If the cross section of the shaft is not circular, a plane cross section does 
not remain plane; it warps, as is shown in Fig. 7.5:1. The problem is to 
calculate the stress distribution and the deformation of the shaft. 

This is an important problem in engineering; for shafts are used to 
transmit torques and they are seen everywhere. The celebrated solution 
to the problem is due to Barre de Saint-Venant (1855), who used the so- 
called semi-inverse method , i.e., a method in which one guesses at part of 
the solution, tries to determine the rest rationally so that all the differential 
equations and boundary conditions are satisfied. The torsion problem is not 
simple. Saint-Venant, guided by the solution of the circular shaft, made a 
brilliant guess and showed that an exact solution to a well-defined problem 
can be obtained. 

We shall consider, then, a cylindrical shaft with an axis z, with the ends 
at z = 0 and z = L. A set of rectangular Cartesian coordinates x, y, z will 
be used, with the x, j/-plane perpendicular to the axis of the shaft. The 
displacement components in the x, y, z-direction will be written as u, v, 
w, respectively. Saint-Venant assumed that, as the shaft twists, the plane 
cross sections are warped but the projections on the x, y-plane rotate as a 
rigid body; i.e., 

(3) u — —azy , v = azx , w = a<p(x, y ), 

where (p(x , y) is some function of x and y, called the warping function, and 
a is the angle of twist per unit length of the bar and is assumed to be 
very small (-C 1). We rely on the function p(x, y ) to satisfy the differential 
equations of equilibrium (without body force) 

0c xx d(T X y (Jffxz _ „ 

dx dy dz ’ 

, . d(Ty X 9(Tyy 9<Jy z ^ 

1 ] dx dy dz 

d<r zx da zy do ZZ Q 

dx dy dz ' 

the boundary conditions on the lateral surface of the cylinder 
&xxVx ~b ^xy^y ~~ d , 

(5) OyxVx+GyyVy =0, 

&zxVx “f" &zyVy — 0 , 
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and the boundary conditions at the ends z = 0 and z — L\ 


a zz = 0 

( 6 ) 

&zx,(7 Z y equipollent to a torque T. 

The constants v x , v y are the direction cosines of the exterior normal to the 
lateral surface (i/ z = 0). 

From Eq. (3), we obtain the stresses according' to Hooke’s law. 


( 7 ) 


Gyz — OiG 




&xy — &xx — Qyy — 0 zz — 0 • 


A substitution of these values into Eqs. (4) shows that the equilibrium 
equations will be satisfied if <p(x, y) satisfies the equation 


( 8 ) 


d 2 <p d 2 tp 
dx 2 dy 2 


throughout the cross section of the 
cylinder. To satisfy the boundary con¬ 
ditions (5), we must have 


( 9 ) 



cos (r, x) 


+ 



cos ( v , y) — 0, 


on C , 


y 



where C is the boundary of the cross section R (Fig. 7.5:2). But 


dp 

dx 


cos (x, u) + cos ( y, v) = 
dy 


dip 

Ifr' 


hence, the boundary condition (9) can be written as 


(10) 


—-— = y cos (rc, v) — x cos ( y , v) on C. 

dv 
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The boundary conditions (6) are satisfied at the ends z = 0 and z = L if 


( 11 ) 

( 12 ) 


IL 


<J ZX dxdy = 0, 


IL 


a zy dxdy = 0, 


JJ (xcr zy - ya zx ) dxdy = T. 


We can show that Eqs. (11) are readily satisfied if satisfies Eqs. (8) and 
(10); because 


IL 


; dxdy 




since <p satisfies Eq. (8). On applying Gauss’ theorem to the last integral, 
it becomes a line integral on the boundary C of the region R: 


aG 



- y cos (x, u) + x cos (y, u) 


ds , 


which vanishes on account of Eq. (10). Similarly, the second of Eqs. (11) is 
satisfied. Finally, the last condition (12) requires that 


(13) T = aG J(^x 2 + y 2 + x^ - y~^J dxdy. 

Writing J for the integral 


(14) js IL{ x2+yl+x ^^ , ^) dxdv ’ 

we have 


(15) T = aGJ. 

This merely shows that the torque T is proportional to the angle of twist per 
unit length a, with a proportionality constant GJ, which is usually called 
the torsional rigidity of the shaft. The J represents the polar moment of 
inertia of the section when the cross section is circular and the warping 
function is zero. However, it is conventional to retain the notation GJ 
for torsional rigidity, even for noncircular cylinders. 

Thus, we see that the problem of torsion is reduced to the solution of 
Eqs. (8) and (10). The solution will yield a stress system <J zx ,a zy . If the 
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end sections of the shaft are free to warp, and if the stresses prescribed on 
the end sections are exactly the same as those given by the solution, then 
an exact solution is obtained, and the solution is unique (see Sec. 7.4). If 
the distribution of stresses acting on the end sections, while equipollent to 
a torque T, does not agree exactly with that given by Eq. (7), then only 
an approximate solution is obtained. According to a principle proposed 
by Saint-Venant, the error in the approximation is significant only in the 
neighborhood of the end section (see Secs. 10.11-10.13). 

Equation (8) is a potential equation; its solutions are called harmonic 
functions. The same equation appears in hydrodynamics. The boundary 
condition (10) is similar to that for the velocity potential in hydrodynamics 
with prescribed velocity efflux over the boundary. In the hydrodynamics 
problem, the condition for the existence of a solution <p is that the total 
flux of fluid across the boundary must vanish. Translated to our problem, 
the condition for the existence of a solution ip is that the integral of the 
normal derivative of the function <p, calculated over the entire boundary C, 
must vanish. This follows from the identity 

(16) J ^-ds = JJ div(grad <p)dxdy — JJ V 2 (pdxdy 


and from the fact that V 2 <p — 0. This condition is satisfied in our case by 
Eq. (10), as can be easily shown. Therefore, our problem is reduced to the 
solution of a potential problem (called Neumann’s problem) subjected to 
the boundary condition Eq. (10). 

An alternate approach was proposed by Prandtl, who takes the stress 
components as the principal unknowns. If we assume that only a xz , a yz 
differ from zero, then all the equations of equilibrium (4) are satisfied if 


(17) 


^12 d(7 yz _q 

dx dy 


Prandtl observes that this equation is identically satisfied if a xz and a yz 
are derived from a stress function ip{x, y) so that 


(18) 


dip dip 

dy ’ ° vz dx 


This corresponds to the stream function in hydrodynamics, if a xz and a yz 
were identified with velocity components. Although ip can be arbitrary as 
far as equilibrium conditions are concerned, the stress system (18) must 
satisfy the boundary conditions (5) and (6), and the compatibility condi¬ 
tions. From Eq. (7.3:6), we see that compatibility requires that (in the 
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absence of body force), 



V 2 <7 y z 

= 0; 

= 0 , 

where V 2 denotes 



i 

f d 2 

a 2 \ 

( 

^dx 2 

dy*J ' 

Hence 



(19, |VV 

= 0 , 

•$- 

ii 

o 

It follows that 



(20) 

v 2 0 

= const. 


Of the boundary conditions (5), only the last equation is not identically 
satisfied. If we note from Fig. 7.5:2, that 

dy dx 

(21) v x — cos (u, x) = — , v y — cos (v, y) = - — , 

we can write the last of Eq. (5) as 


( 22 ) 


dip dy dip dx dip 

dy ds dx ds ds ’ 


on C. 


Hence ip must be a constant along the boundary curve C. For a simply 
connected region, no loss of generality is involved in setting 


(23) 


•0 = 0, on C. 


If the cross section occupies a region R that is multi-connected, additional 
conditions of compatibility must be imposed (see Sec. 4.7). 

It remains to examine the boundary conditions (6). The first, a zz = 
0, follows the starting assumption. The other conditions are stated in 
Eqs. (11) and (12). Now, 


II 


a zx dxdy 



By Gauss’ theorem, this is f c ipu y ds, and it vanishes on account of Eq. (23). 
Similarly, the resultant force in the y-direction vanishes. Thus, Eqs. (11) 
are satisfied. Finally, Eq. (12) requires that 
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which can be transformed by Gauss’ theorem as follows: 


(24) T — - JJ - 2ip^dxdy 

= — / {an/'cos (i/, a:) -f yip cos (u, y)} ds + // 2ipdxdy. 

Jc J Jr 


If R is a simply connected region, the line integral vanishes by the boundary 
condition (23). Hence, 


(24a) 


T — 2 / j ip dxdy. 


Thus, all differential equations and boundary conditions concerning 
stresses are satisfied if ip obeys Eqs. (20), (23), and (24). But there remains 
an indeterminate constant in Eq. (20). This constant has to be determined 
by boundary conditions on displacements. We have, from Eqs. (3) and (7), 


(25) 


dw a zx 

to =~G +a “’ 


dw _ a zy 

~dy ~ ~G 


Differentiating with respect to y and x, respectively, and subtracting, we get 


(26) 


_1_ f dd zx _ da zy 
G\dy ~ dx 


—2a. 


Hence, a substitution from Eq. (18) gives 


(27) 


d 2 ip d 2 ip 
dx 2 dy 1 


2 Ga. 


In this way, the problem of torsion is reduced to the solution of the Poisson 
Eq. (27) with boundary condition (23). 

With either of the two approaches outlined above, the problem of torsion 
is reduced to standard problems in the theory of potentials in two dimen¬ 
sions. Such potential problems occur also in the theory of hydrodynamics, 
gravitation, static electricity, steady flow of heat, etc. A great deal is known 
about these potential problems; many special solutions have been worked 
out in detail and general methods of solution are available. The most pow¬ 
erful tool for potential theory in two dimensions comes from the theory of 
functions of a complex variable. Since this branch of applied mathematics 
is probably well-known to the readers in their study of other branches of 
physics, we shall not elaborate further. In any case, a detailed account of 
this beautiful field will require a book in its own name, and indeed many ex¬ 
cellent books exist (see, for example, Courant, 51 Courant and Hilbert, 101 
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Kellogg, 51 etc.) The complex variable method and the associated singu¬ 
lar integral equations approach are developed in the monumental works 
of Muskhelishvilli; 1,2 a shorter account is given by Sokolnikoff. 1,2 Other 
methods of solution and detailed examples can be found in the classical 
books 1,2 of Love, Sokolnikoff, Southwell, Timoshenko and Goodier, Trefftz, 
Sechler, Green and Zerna. Goodier’s article in Fliigge’s 1,4 Handbook of En¬ 
gineering Mechanics contains results for various cross sections commonly 
used in engineering. 


Example. Bars with Elliptical Cross Section 

Let the boundary of the cross section (Fig. 7.5:3) be given by the 
equation 




Then Eqs. (27) and (23) are satisfied by 
a 2 b 2 Ga ' ~ 2 


= - 


[a 2 + b 2 ) 


( x 2 y 2 

U + ^- 


Equation (24) gives the relation between the torque and the rate of twist 
a = d6/dz. 


na 3 b 3 dO 
(a 2 + 6 2 ) dz' 


The stresses are given by Eq. (18). Note that the curves ip(x,y) = 
const, have an interesting meaning. The slope dy/dx of the tangent to 
such a curve is determined by the formula 

dfP + ctydy = Q 
dx dy dx 


Hence, according to Eq. (18), we have 


(28) 


dy o Z y 

dx CT zx 


Thus, at each point of the curve xp(x, y) = const., the stress vector (cr zx , cr zy ) 
is directed along the tangent to the curve. The curves xp(x, y) = const, are 
called the lines of shearing stress. The magnitude of the tangential stress is 



Hence, r is equal to the absolute value of the gradient of the surface z = 
ip(x,y). The maximum stress occurs where the gradient is the largest. 
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Fig. 7.5:3. Lines of constant warping, according to Saint-Venant. 

In the present example, the lines of shearing stress are concentric el¬ 
lipses. It is easy to see that the spacing of the ip — const, lines are closest 
at the end of the minor axis. The maximum shearing stress occurs there 
and is given by Eq. (29) to be 

a 2 b 

^max = 2 Got—z - pr . 

a 1 + b z 

A general theorem can be proved that the points at which the maximum 
shearing stress occurs lie on the boundary curve of the cross section. 

The warping function <p(x, y) is easily shown to be 

a 2 — b 2 

*=~^+V Xy - 

Contour lines of constant displacement along the z-axis, w = a(p(x ) y) = 
const., are hyperbolas as shown in Figs. 7.5:3, which were taken from Saint- 
Venant’s original publication. The solid lines in the figure indicate where 
in is positive, the dotted lines where ui is negative. 

7.6. SOAP FILM ANALOGY 

As remarked before, Eqs. (7.5:8) and (7.5:27) occur in many other phys¬ 
ical theories entirely unrelated to the torsion problem. For example, if we 
consider a thin film of liquid, such as that of a soap bubble, we see that 
the predominant force acting in the film is the surface tension, which may 
be considered to be constant. The equation of equilibrium of an element of 
soap film is 

T T 
Ri + R~ 2 ~ P ’ 

where Ri, R 2 are the principal radii of curvature and p is the pressure 
per unit area normal to the film. The derivation of this equation is simple 
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and can be understood from elementary considerations as illustrated in 
Fig. 7.6:1. Now if we take a tube whose cross-sectional shape is the same as 



Fig. 7.0:1. Equilibrium of a soap film. 


that of the shaft whose torsional property is questioned, cut a plane section, 
spread a soap film over it under a small pressure p. If the film deflection is 
sufficiently small, the mean curvature of the film is given by the sum of the 
second derivatives of the deflection surface. Thus, 

( d 2 w d 2 w \ 

T {d^ + WJ =p ' w = ° onboundar y’ 

where w denotes the deflection of the film measured from the plane of the 
cross section and x, y are a set of rectangular coordinates. These equations 
are identical with Eqs. (7.5:23) and (7.5:27). Thus, we obtain Prandtl’s 
soap film analogy. The gradient of the soap film is proportional to the 
shear stress in torsion. The volume under the film and above the cross 
section is proportional to the total torque. 

The value of an analogy lies in 
its power of suggestion. Most peo¬ 
ple can visualize the shape of a 
soap film, perhaps because of their 
long experience with it. Thus, with 
the soap film analogy it is very easy 
to explain the stress concentration 
at an re-entrant corner in a cross 
section, such as the points marked 
by P in Fig. 7.6:2. 

Economical and efficient use of materials to transmit forces is an im¬ 
portant objective in engineering; and the problem of avoiding the weakest 



Fig. 7.0 :2. Re-entrant corners suggest 
stress concentration. 
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links — points where stress concentrations occur — is obviously of great 
interest. 


7.7. BENDING OF BEAMS 

When a shaft is used to transmit bending moments and transverse shear, 
it is called a beam. Since beams are used in every engineering structure, the 

The long history of the development 
of man’s understanding of the 
action of the beam is a fasci¬ 
nating subject well-recorded in 
Timoshenko’s book. 1 1 Modern 
investigation began with Galileo, 
but it is again to the credit of 
Saint-Venant that the problem is 
solved within the general theory 
of elasticity. 

Consider first the pure 
bending of a prismatic beam 
(Fig. 7.7:1). Let the beam be 
subjected to two equal and opposite couples M acting in one of its principal 
planes.1 Let the origin of the coordinates be taken at the centroid of a cross 
section, and let the x, 2 -plane be the principal plane of bending. The usual 
elementary theory of bending assumes that the stress components are 


theory of beams is of great importance. 



Fig. 7. 7:1. Pure bending of a prismatic 
beam. 


( 1 ) 


Ex 


<7xv = = Cu 


in which R is the radius of curvature of the beam after bending. It is easily 
verified that the stress system (1) satisfies all the equations of equilibrium 
(7.5:4) and compatibility (7.3:6). The boundary conditions on the lateral 
surface of the beam are also satisfied. If the normal stress at the ends of 
the beam is linearly distributed as in Eq. (1), and if the bending moment is 


= Jj ~Ex 2 dxdy = ~ , 

where I is the moment of inertia of the cross section of the beam with 
respect to the neutral axis parallel to the y-axis, then every condition is 
satisfied, and Eq. (1) gives an exact solution. 


( 2 ) 


M 


-II < 


xdxdy 


principal plane is one that contains the principal axes of the moment of area of the 
cross sections of the beam. 
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Consider next the same prismatic beam loaded by a lateral force P at 
the end z = L, and clamped at the end z = 0 (Fig. 7.7:2). For a beam 
so loaded, the resultant shear P has to be resisted by the shearing stresses 
OzxtQzy Hence, the stress system (1) will not suffice. 




Fig. 7.7:2. Cantilever beam loaded at one end. 


Let the force P be parallel to one of the principal axes of the cross 
section of the beam. (An arbitrary force can be resolved into components 
parallel to the principal axes, and the action of each component may be 
considered separately.) Let P be parallel to the x-axis and let the x, z- 
plane be a principal plane. Saint-Venant, using his semi-inverse method, 
assumes that 


(3) & zz — j i &xx — &yy — &xy — 0 » 

leaving o zx ,o zy undetermined. The first term is given by the elementary 
theory, P(L — z) being the bending moment at section z. Now we must 
see how the equilibrium, compatibility, and boundary conditions can be 
satisfied. In the absence of body force, the equilibrium equations (7.5:4) 
require that 


( 4 ) 

( 5 ) 


da zx 

~aT 


0 , 


d<Tyz 

dz 


= o, 


dcr xz da yz Px 
dx + dy + I 


The Beltrami-Michell compatibility conditions (7.3:6) require that 
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where V 2 = d 2 /dx 2 + d 2 /dy 2 . The stress-free boundary condition on the 
lateral surface requires that [Eq. (7.5:5)] 

(7) a zx cos (u, x) + a zy cos (is,y) = 0, on C . 


The end condition at z = L requires that 


( 8 ) 

( 9 ) 


o zz = 0 , 


// 

// 


o zy dxdy 


<j zx dxdy 


= 0, 
= P. 


The end condition at z — 0 is concerned with the conditions of clamping, 
and it is usually stated in the form 


(10) u = — — 0 at x=y — z = 0. 

dz 

The method of solving Eqs. (4) through (9) is similar to that of Sec. 7.5. 
Equations (4) imply that both a xz and cr yz are independent of z. 
Equation (5) may be written as 


( 11 ) 


d ( Px 2 ,\ da yz 


where f(y) is a function of y only. Equation (11) can be satisfied identically 
if the stresses a xz , <J yz are derived from a stress function ip(x,y) such that 


( 12 ) 


® xz — 


dip 

dy 


Px^ 

-2T + ^)* 


a 


yz — 


dip 

dx 


Equations (6) imply that 
(13) A(vV)=0, 


d ._ 2 ,, v P 

_( v _ 1 + i/ j 


fl 

dy 2 ' 


Hence, 


(14) 


d 2 ip d 2 ip v Py df 

dx 2 dy 2 1 + v I dy + 


The integration constant c has a very simple physical meaning. Consider 
the rotation w of an element of area in the plane of a cross section. 
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The rate of change of this rotation in the 2 -axis direction is 



de x z 

dy 


_ J_ / da yz _ do xz \ __1_ /<9> dhp d£\ 

2 G \ dx dy ) 2G \ dx 2 dy 2 dy) 


In deriving the last line, Hooke’s law and Eq. (12) are used. Hence, Eq. (14) 
leads to 


(17) 


dui 

■ 2G at 


v Py 

l + i/ I 


+ c. 


This shows that c represents a constant rate of rotation, i.e., it corresponds 
to a rigid-body rotation of a cross section (the same kind as in the torsion 
problem). It can be shown that by a proper shifting of the load P parallel 
to itself in the plane z = L, the torsional deformation can be eliminated so 
that c = 0. (This leads to the concept of shear center, the point through 
which P must act so that c = 0.) In the following discussion we shall 
assume that P acts through the shear center. The more general problem 
can be solved obviously by a linear superposition of the solutions of bending 
and of torsion. 

On setting c = 0, Eq. (14) becomes 


d 2 ip d 2 ip v Py df 
dx 2 dy 2 l + i/7 dy 


The boundary condition (7) now requires that [see Fig. 7.5:2 and 
Eqs. (7.5:21)] 


(19) 


dip dy dip dx dip 
dy ds dx ds ds 


Px 2 ,, s 


dy 

ds 


From these equations, the value of ip can be determined up to an integration 
constant which does not contribute anything to the stress system. The 
function f(y) is arbitrary; it was introduced by Timoshenko to simplify the 
solution in case the boundary curve of the cross section can be written in 
the form 


(20) C: — -f{y)= 0. 

This would be the case, for example, if C is a circle or an ellipse. In such a 
case, we choose f(y) according to Eq. (20). Then the boundary condition 
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may be written as 


(21) V = 0 onC. 

It remains to show that the load at the end z = L is equipollent to a 
shear P, i.e., that Eqs. (8) and (9) are satisfied. This is easily done. For 
example, using Eqs. (5) and (7), we have 


jj,,.dxd, = ff 


&XZ "t - & 


(da X z da yz \ 

V dx 9y ) 


Px 2 


dxdy 



cos (is, x) + a yz cos (is, y)\ds = P, 


Oyz + y 


da xz da yz 
dx dy 


Pxy 


dxdy 


JJ o yz dxdy = JJ 

= JJ y[v X z cos (is, x) + o yz C o S (v,y)]ds = 0, 


since ff xydxdy vanishes because the T-axis is assumed to be a principal 
axis. 

Thus, all the equations are satisfied if a solution ip(x,y) is found from 
Eqs. (18) and (19) or (21). This reduces the beam problem to a standard 
problem in two-dimensional potential theory. 

If a solution is obtained and the moment of the shearing stresses is 
computed and set equal to Pe 


( 22 ) 



- yvxz) dxdy = Pe, 


the constant e will give the location of the shear center. The applied load 
must have an eccentricity e in order to obtain a bending without twisting 
(see discussion about the constant c above). 


Example. Beam of Circular Cross Section of Radius r 
The boundary curve C is given by the equation 


y ~r 


Hence, we take 

f(y) = JJj( r2 ~ y 2 )' 

Equation (18) becomes 

d 2 ejs d 2 tp 1 + 2 is Py 
dx 2 + dy 2 1 + is I 
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It is 

( 21 ) 


easily verified that the solution that satisfies the boundary condition 
is 


(l + 2^P 2 2 _ 2 

v 8(1 + 1 /) 7 1 y ,y 


The stress components are 


O xz 


(3 + 2 u) P( 2 x2 1-2 v 2 \ 
8(l + n) I V 3 + 2v V ) 


(1 + 2 v) Pxy 
ryz = ~4(i TV) i 


A large number of examples can be found in the books of Love, 11 
Sokolnikoff, 1 ' 2 Timoshenko, 1 2 etc. An extensive discussion of the shear 
center can be found in Sechler’s and Sokolnikoff’s books. 1 2 There are sev¬ 
eral possible definitions of shear center and center of twist [Trefftz (1935), 
Goodier (1944) and Weinstein (1947)]; an outline and comparison of vari¬ 
ous definitions can be found in Fung’s Aeroelasticity 10 5 (1957) pp. 471-475. 
See Bibliography 7.2, p. 881. 


7 . 8 . PLANE ELASTIC WAVES 

As a further illustration of the theory of elasticity, let us consider some 
simple types of waves in an elastic medium. The displacement components 
ui,U 2 ,U 3 (or in unabridged notations, u,v,w) will be assumed to be in¬ 
finitesimal so that all equations are linearized. The basic field equation, in 
the absence of body force, is Navier’s Eq. (7.1:9) 

O^Ui 

(1) p = Giiijj + (A + G)uj t ji . 

We shall first verify that the motion 

2tt 

(2) u = A sin — (x ± ct) , v = w = 0 , 

where A, l, c are constants, is possible if c assumes the special value cl, 


(3) A 


CL 


A + 2 G 


E{ 1 - u) 


(1 + i/)(l - 2 v)p ' 


This can be verified at once by substituting Eqs. (2) into (1). The pattern of 
motion expressed by Eqs. (2) is unchanged when x ± cit remains constant. 
Hence, if the negative sign were taken, the pattern would move to the right 
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with a velocity cl as the time t increases. The constant cl is called the 
phase velocity of the wave motion. In Eqs. (2), l is the wave length, as can 
be seen from the sinusoidal pattern of u as a function of x, at any instant of 
time. The particle velocity represented by Eqs. (2) is in the same direction 
as that of the wave propagation (namely, the a;-axis). Such a motion is said 
to constitute a train of longitudinal waves. Since at any instant of time 
the wave crests lie in parallel planes, the motion represented by Eq. (2) is 
called a train of plane waves. 

Next, let us consider the motion 

2tt 

(4) u — 0, v = Tsin — (x ± ct ), w = 0, 

which represents a train of plane waves of wave length l propagating in the 
re-axis direction with a phase velocity c. When Eqs. (4) are substituted into 
Eqs. (1), it is seen that c must assume the value ct, 


(5) A 


CT = 



The particle velocity (in the (/-direction) represented by Eqs. (4) is perpen¬ 
dicular to the direction of wave propagation (^-direction). Hence, it is said 
to be a transverse wave. The speeds cl and cy are called the characteris¬ 
tic longitudinal wave speed and transverse wave speed, respectively. They 
depend on the elastic constants and the density of the material. The ratio 
ct/cl depends on Poisson’s ratio only, 


(6) A 


cy — Cl 


1 - 2v 

2(1 — v) 


If v — 0.25, then cl = 

Similar to Eqs. (4), the following example represents a transverse wave 
in which the particles move in the z-axis direction. 

27T 

(7) u = 0, v = 0, w — Asm—(x ± ert) ■ 


The plane parallel to which the particles move [such as the x, (/-plane in 
Eqs. (4), or the x, z-plane in Eqs. (7)], is called the plane of polarization. 

Table 6.2:1, Sec. 6.2, gives a brief list of the longitudinal wave velocities 
of some common media. It is very interesting to see that most metals and 
alloys have approximately the same wave velocities. 
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Plane waves as described above may exist only in an unbounded elas¬ 
tic continuum. In a finite body, a plane wave will be reflected when it 
hits a boundary. If there is another elastic medium beyond the boundary, 
refracted waves occur in the second medium. The features of reflection 
and refraction are similar to those in acoustics and optics; the main dif¬ 
ference is that, in general, an incident longitudinal wave will be reflected 
and refracted in a combination of longitudinal and transverse waves, and 
an incident transverse wave will also be reflected in a combination of both 
types of waves. The details can be worked out by a proper combination of 
these waves so that the boundary conditions are satisfied. See Sec. 8.14. 

7.9. RAYLEIGH SURFACE WAVE 

In an elastic body, it is possible to have another type of wave, which 
is propagated over the surface and which penetrates only a little into the 
interior of the body. These waves are similar to waves produced on a smooth 
surface of water when a stone is thrown into it. They are called surface 
waves. The simplest is the Rayleigh wave that occurs on the free surface 
of a homogeneous, isotropic, semi-infinite solid. It is an important type of 
wave because the largest disturbances caused by an earthquake recorded 
on a distant seismogram are usually those of Rayleigh waves. 

The criterion for surface waves is that the amplitude of the displacement 
in the medium diminishes exponentially with increasing distance from the 
boundary. 

Let us demonstrate the existence of Rayleigh waves in the simple two- 
dimensional case. Consider an elastic half-space y > 0. The surface y = 0 
is stress-free. Let us consider displacements represented by the real part of 
the following expressions: 

u = Ae~ by exp[ifc(a: — ci)], 

(1) v = Be~ by exp[ik(x — ct)\, 

w = 0, 

where i is the imaginary unit k is the wave number (a constant), 

and A and B are complex constants. The coefficient b is supposed to be 
real and positive so that the amplitude of the waves decreases exponentially 
with increasing y, and tends to zero as y —> oo. 

We would first see if the displacements given by Eq. (1) can satisfy 
the equations of motion, which, in view of the definitions of cl and ct by 
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Eqs. (7.8:3) and (7.8:5), can be written as 
d 2 u 

(2) = CrUi'jj + (c 2 L - cl)uj tji . 

Substituting Eqs. (1) into (2), cancelling the common exponential factor 
and rearranging terms, we obtain the equations 

[ciph 2 + (c 2 — c 2 L )k 2 ]A — i(c 2 — cl)bkB = 0 , 

(3) 

— i(c\ — cl)bkA + [c\b 2 + (c 2 — cl)k 2 ]B = 0 . 

The condition for the existence of a nontrivial solution is the vanishing of 
the determinant of the coefficients, which may be written in the form 

(4) [c^6 2 - ( c 2 l - c 2 )k 2 \[clb 2 - (cl - c 2 )k 2 ] = 0. 


This gives the following roots for b. 

1/2 


(5) 


b' = k 


1 - — 
C L 


b" = k ( 1 - ^ 2 - 


1/2 


The assumption that b is real requires that c < c? < c^. Corresponding to 
b 1 and b", respectively, the ratio B/A can be solved from Eq. (3). 


( 6 ) 




ik 

V' ' 


Hence, a general solution of the type (1), satisfying the equations of motion, 
may be written as 

u = A'e~ b y exp\ik(x — ct )] + A!'e~ b ' v exp [ik(x — ct )], 

// 7 A* / 

(7) y — — — A'e~ b y exp[ik(x — ct)} + —A"e~ b y exp[ik(x — ct )], 
ik b" 

w = 0 . 


Now, we would like to show that the constants A', A ", k, and c can be 
so chosen that the boundary conditions on the free surface y — 0 can be 
satisfied: 


( 8 ) 


Cyx - Oyy (Jyz - 0 ) 


on y = 0 . 
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By Hooke’s law, and in view of Eq. (7), conditions (8) are equivalent to 


(9) 


du dv 
dy dx 


= 0 , 



<9tA 

dy) 


dv 

+ 2G— = 0, 
dy 


on y — 0, 
on y = 0. 


On substituting Eqs. (7) into Eq. (9), setting y = 0, omitting the common 
factor exp[ifc(a: — ct)], and writing 


G - pc? , A — p{c\ — 2cj ~), 


we obtain the results 


( 10 ) 


-2b'A' - 



A" = 0, 


(i c L~ 2c T ) c l^2'^) 


— 2c^A” = 0 . 


This can be written more symmetrically, by Eqs. (5), as 


( 11 ) 


WA ‘ + ( 2 -i) k 2 w= 0 ’ 

/ r 2 \ A" 

( 2 -4)^ + 26 V = °- 


For a nontrivial solution, the determinant of the coefficients of A ', A" must 
vanish, yielding the characteristic equation for c 


( 12 ) 



The quantity c 2 /cj- can be factored out after rationalization, and 
Eq. (12), called the Rayleigh equation, takes the form 



If c = 0, Eqs. (7) are independent of time, and from Eq. (11) we have A" = 
—A! and u = v — 0. Hence, this solution is of no interest. The second factor 
in Eq. (13) is negative for c = 0, cr < cl, and is positive for c = cj\ There is 
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always a root c of Eq. (13) in the range (0, ct). Hence, surface waves can 
exist with a speed less than ct- 

For an incompressible solid ci —► oo, Eq. (13) becomes 


(14) 



- 16 = 0. 


This cubic equation in c 2 has a real root at c 2 = 0.91275c 2 ., corresponding 
to surface waves with speed c ~ 0.95538ct- The other two roots for this 
case are complex and do not represent surface waves. 



Fig. 7.9:1. Schematic drawing for Rayleigh surface waves. 

G and c l — %/3 ct, Eq. (13) 


This equation has three real roots: c 2 /c|. = 4, 2 + 2/V3, 2 — 2/\/3. The last 
root alone can satisfy the condition that b' and b" be real for surface waves. 
The other two roots correspond to complex b' and b" and do not represent 
surface waves. In fact, these extraneous roots do not satisfy Eq. (12); they 
arise from the rationalization process of squaring. The last root corresponds 
to the velocity 

(16) Cfi = 0.9194ct , 

which corresponds, in turn, to the displacements 

u = 4'( e -o.8475fcy _ o.5773e“°- 3933fcy ) cos k(x - CR t ), 
v = j 4'(—0.8475e _0,8475fc!/ + 1.4679e-°- 3933/c!/ ) sin k(x - c R t ), 


If the Poisson ratio is so that A 
becomes 


(15) 


-o- -8- 

Cj- i 


56 c 2 
^3 Ct 


(17) 
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where A' is taken to be a real number, which may depend on k. Prom 
Eq. (17), it is seen the particle motion for Rayleigh waves is elliptical ret¬ 
rograde in contrast to the elliptical direct orbit for surface waves on water 
(see Fig. 7.9:1). The vertical displacement is about 1.5 times the horizontal 
displacement at the surface. Horizontal motion vanishes at a depth of 0.192 
of a wavelength and reverses sign below this. 

Figure 7.9:2 shows KnopoiTs calculated results for the ratios cr/ct, 
cr/cr for Rayleigh waves as functions of Poisson’s ratio v. 




0 0.1 0.2 0.3 0.4 0.5 0 0.1 0.2 0.3 0.4 0.5 

Poisson's rotio v Poisson's ratio v 

Pig. 7.9:2. Ratios cr/cl, cr/ct, and cy/c^, where cr = Rayleigh wave speed, cr = 
the longitudinal wave speed, ct = the transverse wave speed. 


7.10. LOVE WAVE 

In the Rayleigh waves examined in the previous section the material 
particles move in the plane of propagation. Thus, in Rayleigh waves over 



/ 

Fig. 7.10:1. A layered half-space. 


the half-space y > 0 along the 
surface y — 0, propagating in the 
2 -direction, the ^-component of dis¬ 
placement w vanishes. It may be 
shown that surface wave with dis¬ 
placements perpendicular to the di¬ 
rection of propagation (the so-called 
SH waves) is impossible in a 
homogeneous half-space. However, 
SH surface waves are observed as 
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prominently on the Earth’s surface as other surface waves. Love showed 
that a theory sufficient to include SH surface waves can be constructed 
by having a homogeneous layer of a medium M i of uniform thickness Hi, 
overlying a homogeneous half-space of another medium M. 

Using axes as in Fig. 7.10:1, we take u = v = 0, and 


( 1 ) 

in M , and 


w = j4exp 



exp[iA:(a; — ct)] 


(2) w = | 

^ Ai exp 



[ 





/ \ 2 

1 


I c \ 


k \ 

tit ) y 


\ 

V / 

J 


x exp[ifc(a: — ct)} 


in Mi. It is easily verified that these equations satisfy the Navier’s equa¬ 
tions. If c < ct, then w — > 0 as y —¥ oo, as desired. 

The boundary conditions are that w and a xy must be continuous across 
the surface y = 0, and a zy zero at y — —H\. On applying these conditions 
to Eqs. (1) and (2), we obtain 


(3) A=*Ai+A[, 

(4) GA[1 - (c/c r )T /2 - GMi - A[)[ 1 - (c/c^) 2 ] 1 ' 2 , 

(5) A\ exp{fcf/i[l — (c/c^) 2 ] 1 ' 2 } = A) exp{ — kHi[l — (c/c^P) 2 ] 1 / 2 } . 

Eliminating A from Eqs. (3) and (4), and then using Eq. (5) to eliminate 
Ai and A\ , we have 


G \ 1 -( c / c t ) 2 }^ 2 
Gi[l - (c/c£>)2]V2 

Hence, we have 


Ai - A\ 
Ai+A\ 


= tanh{A:ILi[l — (c/c^) 2 ] 1 / 2 } . 



as the equation to give the SH surface wave velocity c in the present 
conditions. 
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If Cj) < ct, Eq. (6) yields a real value of c which lies in the range 
CjP < c < ct and depends on k and Hi (as well as on G, Gi, ct , and 
c^), because for c in this range the values of the left-hand-side terms in 
Eq. (6) are real and opposite in sign. Thus, SH surface waves can occur 
under the stated boundary conditions, provided the shear velocity cp* in 
the upper layer is less than that in the medium M. These waves are called 
Love waves. 

Love waves of general shape may be derived by superposing harmonic 
Love waves of the type (2) with different k. The dependence of the wave 
speed c on the wave number k introduces a dispersion phenomenon which 
will be considered later. 


PROBLEMS 

7.2. Derive Navier’s equation in spherical polar coordinates. 

7.3. Prom data given in various handbooks, determine the longitudinal and 
shear wave speeds in the following materials: 

(a) Gases: air at sea level, and at 100,000 ft altitude. 

(b) Metals: iron, a carbon steel, a stainless steel, copper, bronze, brass, nickle, 
aluminium, an aluminium alloy, titanium, titanium carbide, berylium, bery- 
lium oxide. 

(c) Rocks and soils: a granite, a sandy loam. 

(d) Wood: spruce, mahogany, balsa. 

(e) Plastics: lucite, a foam rubber. 


7.4. Sketch the instantaneous wave surface, particle velocities, and particle 
paths of a Love wave. 

7.5. Investigate plane wave propagations in an anisotropic elastic material. 
Apply the results to a cubic crystal. Note: 


d 2 Uj _ d 2 ui 

P Qp '3 kl dXjdXk ’ 


ui = Aie~ 


i(ut-kjXj) 


where k(fci, & 2 , k-i) is the wave number vector normal to the wave front. 

7.6. Determine the stress field in a rotating, gravitating sphere of uniform 
density. 
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SOLUTIONS OF PROBLEMS IN LINEARIZED THEORY 
OF ELASTICITY BY POTENTIALS 


In this chapter we shall consider the use of the method of potentials in treat¬ 
ing static and dynamic problems of an isotropic elastic body subjected to 
forces acting on the boundary surface of the body, the forces being indepen¬ 
dent of the deformation of the body. The main problem is to determine the 
stresses and displacements at every point in the body under appropriate 
boundary conditions. The displacements will be assumed to be infinitesi¬ 
mal and Hooke’s law is assumed to hold, so that the basic equations are 
those listed in Sec. 7.1. 

In both static and dynamic problems, we may start with Navier’s equa¬ 
tion and try to determine a continuous and twice differentiable solution u* 
under appropriate boundary conditions. In the case of static equilibrium, 
we may also start with a general solution of the equation of equilibrium 
and use the compatibility and boundary conditions to determine the unique 
answer. 

To simplify the solution, a number of potentials have been introduced. 
The most powerful ones are presented below. Potentials related to dis¬ 
placements are the scalar and vector potentials, the Galerkin vectors, and 
the Papkovich-Neuber functions. These will be discussed in this chap¬ 
ter. Potentials that generate systems of equilibrating stresses are the Airy 
stress-functions, the Maxwell-Morerastress functions, etc. (see Chapter 9). 
Some applications will be illustrated in this and subsequent chapters. 

8.1. SCALAR AND VECTOR POTENTIALS FOR 
DISPLACEMENT VECTOR FIELDS 

It is well-known (sometimes referred to as Helmholtz’ theorem) that any 
analytic vector field u(ui, rt 2 , U 3 ) can be expressed in the form 

(1) A Oj - - (j)^ -}- eijkipkj 
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i.e., 

(2a) ▲ u = grad (p + curl ip 


involving three equations of the type 
, oM . 9 <p , dip 3 dip 2 

A Ul dx i + dx 2 dx 3 ’ 

where <p is a scalar function and ip is a vector field with three components. 
ip u ip2, ip 3 . The requirement (1) leaves ipi indeterminate to the extent that 
its divergence is arbitrary. For definiteness, we may impose the condition 


(3) ipi,i = 0 . 

Then <p is called the scalar potential and ip the vector potential of the vector 
field u. We shall prove this theorem presently by showing that the functions 
<p and ip\ , ip2, ip3 so proposed can be found. 

Applying Eq. (1) to the elastic displacement vector Uj, we shall see 
that the dilatation e can be derived from the scalar potential (p and that 
the rotation can be derived from the vector potential ip. Note first that if 
Eq. (1) is effected, then 

(4) V , i t i = (p } ii 4" aijk^PkJi • 

The last term vanishes when summed in pairs since ipk,ji = ipk,ij, but 
6y/[ —- ejik' Hence, 

(5) e — Ui^ — <p } U . 

( 6 ) eijkUk,j = Oijk*P,kj 4 “ ^ijkekhnlpmjj 

= ekijekim.1pm.i3 [by symmetry of <p jk j] 


— ^im^ji)ipm,ij [by the e-6 identity, Eq. (2.1.11)] 
= — 1pi,jj 4 " Ipj.ij 

- -ipi,jj [by virtue of Eq. (3)]. 


The curl of it,, given by the left-hand side of Eq. (6), is twice the rotation 
vector u>i: 


1 

Wi — — CijfcUk f j • 


Hence, we may write 

(7) 


ipi,jj = -2 u>i. 
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This shows that each component of the vector ipi is related to a component 
of the rotation vector field. 

Now, particular solutions of Eqs. (5) and (7) are known in the theory 
of Newtonian potentials. They are 

(8) (j>(xi,X2,x 3 ) = —J- JJJ r ( x i • - r j- - r :i dx^dx^dx^ i 

(9) ipi{x\,x 2 ,x 3 ) = J- JJJ dx\dx' 2 dx'z , 

where 

r 2 = (xi - x\) 2 + (x 2 - x' 2 ) 2 + (ac 3 - x' 3 ) 2 

and the integration extends over the entire body. But Vh given in Eq. (9) 
does not always satisfy Eq. (3). To overcome this difficulty, we note that 
the curl of the gradient of a scalar function vanishes, and a general vector 
potential is 

(10) ip i = ip i + 0 A , 

where 8 is a scalar function. The added term 8 ti contributes nothing to u l . 
However, 0 can be determined in such a manner that 

( 11 ) e,u = - 4 >i,i 

by an integral analogous to Eq. (8). With 9 so determined, the function ipi 
given by Eq. (10) will satisfy the requirement Eq. (3). 

The general solution of Eq. (5) is the sum of Eq. (8) and an arbitrary 
harmonic function <p *. The general solution of Eq. (7) is the sum of Eq. (10) 
and a set of harmonic functions ip*. Thus, Eq. (1), considered as a set of 
partial differential equations for the unknown functions <p and ip, can be 
solved. Q.E.D. 

Love shows ( Mathematical Theory of Elasticity, 1 1 p. 48), through inte¬ 
gration by parts of Eqs. (8) and (9), that the following resolution meets all 
the potentials requirements: 

(12) <p = JJJ u i {x\,x' 2 ,x' z )-^- dx\dx' 2 dx' 3 , 

( 13 ) V’i = ^ JJJ e ijkUk( x 'i, x' 2 , x 's)JJfr dx\dx'2dx ' 3 . 

Pendse (Phil. Mag. Ser. 7, 39: 862-867, 1948) pointed out, however, that 
the use of scalar and vector potentials is not always free from ambiguity. 
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8.2. EQUATIONS OF MOTION IN TERMS OF 
DISPLACEMENT POTENTIALS 

Let the displacement vector field iq, u 2 , u 3 be represented by a scalar 
potential <j>(xi,X 2 ,X 3 , f) and a triple of vector potentials ipi{xi,x 2 ,X 3 ,t), 
i = 1,2,3, so that 


( 1 ) 


_d(j) dip k 

U i — rj "b &ijk r\ , 

axi axj 


ipi,i = 0. 


We shall assume that the body force is zero. [If this is not the case, the 
body force term can be removed by a particular integral such as Kelvin’s, 
see Eq. (8.8:9). See also Sec. 8.18.] We can write the Navier’s equation of 
linear elasticity for a homogeneous, isotropic body, 


( 2 a) 


d 2 m 

P dt 2 


Guijj + (A + G)ujji 


in the form 


( 2 ) 


a (d 2 <t>\ 

P dxi \ dt 2 ) 


+ peijk dxi 



(A 


G)^V 2 4> + G^- v2 « 

axi oxi 


+ Ge 


ijk 


dx 


-V 2 i/- fc , 


where V 2 is the Laplacian operator 

X7 2 -— — — 

dx\ + dx 2 + dx 2 ' 


The fact that '0i,i = 0, [Eq. (8.1:3)] is used here. Furthermore, since the 
displacements are assumed to be infinitesimal, any change in the density p 
is an infinitesimal of the first order, as can be seen easily from the equation 
of the continuity [Eq. (5.4:4)]. Since in Eq. ( 2 a), p is multiplied by small 
quantities of the first order, Uj, it can be treated as a constant. Now, it 
is easy to see that Eq. ( 2 ) will be satisfied if the functions (j> and tpi, are 
solutions of the equations 


(3) A 

where 


1 d 2 <j) 




1 d 2 xpk 


dt 2 


IX + 2G 
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These are wave equations. They indicate that two types of distur¬ 
bances with velocities cl and cy may be propagated through an elastic 
solid. A comparison with Eqs. (7.8:3) and (7.8:5) shows that cl and ct are 
the speeds of plane longitudinal and transverse waves, respectively. These 
waves are also referred to as dilatational and distorsional waves. The latter 
are also known as shear, equivoluminal , or rotational waves. If we write e for 
the dilatation and u (a; 1 , 012 ,^ 3 ) for the rotation, then, by Eqs. (8.1:5) 

and (8.1:7), we have, from Eq. (3), 


(5) 


V 2 e — 


1 d 2 e 
c 2 h dt 2 ’ 


vV = 


1 9V 

Cy dt 2 


Thus, by introducing the potentials <f> and ipi , 1 P 2 , yd, we have reduced the 
problems of linear elasticity to that of solving the wave equations. 

If static equilibrium is considered, then all derivatives with respect to 
time vanish, and we see that Eq. (2) is satisfied if 


(6) 


S7' 2 <f> = const., V 2 , 0t = const. 


In the following sections we shall consider some applications to statics based 
on Eqs. ( 6 ). 

Of course, not all solutions of Eq. (2) are given by Eqs. (3) and ( 6 ). 
We shall now consider a more general solution. Such a generalization often 
becomes important when we wish to examine the limiting case of a dynamic 
problem as the loading tends to a steady state. For example, if we consider a 
load suddenly applied at t = 0 on a body and then kept constant afterwards, 
we may question whether the steady-state solution is given by Eqs. (3) with 
d(j)/dt, dipk/dt terms set to zero. A quick comparison with Eq. ( 6 ) shows 
that this may not be the case if the constants in Eq. ( 6 ) do not vanish. 

Let us differentiate Eq. ( 2 ) with respect to x m . 


(?) 


fl 2 / W d 2 ( d 2 j> k \ 

dx m dxi V dt 2 ) l:jk dx m dxj \ dt 2 J 


= (A + G) 


d 2 


dxidxr, 


V 2 (j> + G 


d 2 


dx,dx Ti 


V 2 </> + Ge 


ijk 


dx m dxj 




A contraction of m with i yields 



1 d 2 <j)\ 

cl at 2 ) 


( 8 ) 


V : 


= 0 . 
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A multiplication of (7) with e im i, summing over m and summing over 
i, and using Eq. (8.1:3), yields, on the other hand, 


(9) 

Hence, 


v 2 (v 2 v>* 


1 d 2 ipk \ 

cl df> ) 


= 0 . 


(10) A VV-g-P=3>, V 2 * = 0, 

(11) a W*, = o, 

where $, \ti, T 2 , ^ 3 , are harmonic functions. The $ and T’s are not 
entirely independent, but are connected through Eq. ( 2 ) 


(12) A 


Cj, dxi 


T Cijk 


d 

dxj 


^fc = 0 . 


Equations ( 10 )~( 12 ) give the general solution of (2). Equations (3) are ob¬ 
tained by setting $ = 'S'k — 0; Eqs. ( 6 ), by taking $ and ’ft to be constants 
and by letting time derivatives vanish. 


8.3. STRAIN POTENTIAL 


In this section we shall consider the static equilibrium of elastic bodies 
in the restricted case in which the components of displacements can be 
derived from a scalar function <f>(x 1 , 0 : 2 , £ 3 ) s0 that 


(1) 


2 Gui 


d(f> 

dx{ 


The function 4> is called Lame’s strain potential. With Eq. (1), the dilatation 
e, the strain tensor ey, and the stress tensor < 7 ^ are 


(2) 

2Ge = 2 Gui t i — <f> t n > 


(3) 

e ij ~ T U j,i) 

II 

81- 

(4) 

0 — h&ij e I 2 

— hSiij 6 I -1 


In the absence of body force, the function <j> must satisfy Eqs. (8.2:6). 
Since our objective is to obtain some solution, not necessarily general, the 
constant in (8.2:6) will be chosen as zero. Then cj) satisfies the Laplace 
equation 


(6) 


V 2 <£ = 0. 
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Hence, it is a harmonic function by definition. Under this choice, we have 

(7) <7ij = . 

In unabridged notations with respect to rectangular Cartesian coordinates 
x,y,z, Eqs. (2)-(7) are 

(8) 2Cu = ~ 2 Gv = d 4, 2 C» = #, 

ox oy oz 


(8) 

2C “=T’ 

oy 

(9) 

2 Ge = V 2 (j> = 

f d 2 d 2 

l^Sa: 2 ^ dy 2 + 

(10) 

d 2 (f> 

° xx ~ dx 2 ' 

_ d 2 cf> 
xy dxdy 


Corresponding formulas for curvilinear coordinates can be obtained by 
converting Eqs. (l)-(7) into general tensor equations and then specializing. 
The results for cylindrical coordinates r, 6, z are as follows, where (£ r , £o, £*) 
and ( o rr ,o r g , etc.) denote the physical components of displacement and 
stress, respectively. 

fill - 9<t> 2 Cf a - l9<t> 2 re 

(U) 2 2GU-j- gt 

. . f d 2 Id Id 2 d 2 \ n 

( ^ ^ “ ( dr 2 + r dr + r 2 d6 2 + dz 2 ) * ~ ° ’ 


1 d(j> 1 d 2 (}> 


dr 2 ’ 
d fld<f> 


dr \r 86 J drdO \r ) 

mi n - 1 d2< t> a - 
^ ^ 6z r dQdz ’ zr dzdr 

Harmonic functions are well-known. For example, it can easily verified that 
the functions 


(16) 

.A(a; 2 — y 2 ) + 2 Bxy , 


(17) 

Cr n cos nd, 

r 2 = x 2 + y 2 

(18) 

C\og(r/a ), 

5ft 

+ 

II 
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(19) 

CO, 

Pi 

1 

1 

II 

(20) 

C 

R ’ 

R* = x* + y* + z\ 

(21) 

C log{R + 2 ), 


(22) 

Clog| (fil+Z ' c) ^ 2 “"- c) }, 

R\ = r 2 + (2 - c) 2 
R\ = r 2 + (;8 + c) 2 


are harmonic functions, and combinations of them can solve some important 
practical problems in elasticity. In Eqs. (16)-(19) the coordinate 2 does not 
appear. Hence, the component of displacement in the 2 -direction, d(j>/dz, 
vanishes, and any combination of these functions can only correspond to a 
plane strain condition. It is obvious that the solutions (17)-(19) may be 
useful for cylindrical bodies, (20) may be useful for spheres, and (21) and 
(22) may be useful for bodies of revolution about the 2 -axis. 

To the stress systems derived from these potentials, we may add the 
uniform stress distributions given by 


(23) 

$ = Cr 2 = C(x 2 + y 2 ), 

(24) 

<t> = CR 2 = C(x 2 + y 2 + 2 2 ; 


which are special solutions of Eq. (8.2:6). With Eq. (23) or (24), Eqs. (9)- 
(10) and (12)—(15) must be modified because e^O. 

Example. Hollow Spheres Subjected to Internal and External Pressure 
Let a hollow sphere of inner radius a and outer radius b be subjected to 
an internal pressure p and an external pressure q. Because of the spherical 
symmetry of the problem, it will be advantageous to use spherical coordi¬ 
nates R, 8, <p. All shearing stresses erne, e, etc., vanish on account of 
symmetry. 

The solution is furnished by the strain potential (20) and (24), 

(t> = % + DR 2 , R 2 = x 2 + y 2 + 2 2 , 

K 

from which 

^ = ,„ n d 2 <j> 3 Cy 2 C 

8x 2 R 5 R 3 ’ 8y 2 fis 
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Hence, at the point (R, 0,0) 

2C 

&RR — &xx — i 2£) , 

C 

°ee = Vyy - - -^3 + 2D. 

The constants C and D are easily determined from the boundary conditions, 
&rr = — p when R = a, ctrr = — q when R = b. 


The results are 


{b/Rf - 1 1 - (, a/R ) 

P ( b/a ) 3 - 1 Q 1 - (a/6) 3 


p (b/R) 3 + 2 9 (a/i?) 3 + 2 

a0e 2 (6/a) 3 - 1 2 1 - (a/6) 3 ' 


Problem 8.1. Find the solution for a solid sphere subjected to a uniform 
external pressure q. 

Problem 8.2. Let the inner wall of the hollow spherical shell, at R = a, be 
rigid (a condition realized by filling the hole with some incompressible material). 
When the outer surface R = 6 is subjected to a uniform pressure q, what are the 
stresses at the inner wall? 

Problem 8.3. Show that the function 4> = CO solves the problem of a circu¬ 
lar disk subjected to uniformly distributed tangential shear on the circumference. 

8.4. GALERKIN VECTOR 

We have shown in Sec. 8.2 that when the displacement field Ui is ex¬ 
pressed in terms of scalar and vector potentials, </> and 0i, ' 02 > 03 , 
respectively, 

(1) 2Gu{ — (f> t i -p Zijklftkjj y *01,i = b 

the equations of static equilibrium for a homogeneous isotropic linear elastic 
medium are satisfied if 

(2) V 2 0 = const., VVfc = const. 

Thus, a broad class of problems in elastic equilibrium is solved by deter¬ 
mining the four functions 0, 0i, 02, 03• However, this may not be the 
general solution. The general solution (8.2:10)-(8.2:12) is more complex. 
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In searching for other solutions of equal generality, Galerkin introduced, 
in papers published in 1930, displacement potential functions which satisfy 
biharmonic equations. Papkovich noted in 1932 that Galerkin’s functions 
are components of a vector. It is perhaps simpler to introduce the Galerkin 
vector by considering the vector potential ipk itself as being generated by 
another vector field Fi, 

(3) iftlc = CklmCFjri’l , 
where c is a constant. Then Eq. (1) becomes 

(4) 2 Glti — (f)'i €ijk&klmCFm,lj • 

This can be simplified by means of the e-6 identity [Eq. (2.1:11)] into 

(5) 2 Glti — (p, i (<5;/ 5j m S im Sji)cFm,lj 

— <j> t i + cFijj — cFjji. 

But cFjj is a scalar function, and can be specified arbitrarily without 
disturbing the definition (3), This suggests the representation'! 

( 6 ) 2 Gm = cFijj - Fjji. 

The hope now arises that, by a judicious choice of the constant c, the Navier 
Eq. (7.1:9), namely, 

(A + G)ujjk + Guk,mm + Xk = 0, 

where Xk represents the body force per unit volume, can be simplified. 
Indeed, on substituting Eq. (6) into the Navier equation, and remembering 
that A + G = <?/( 1 - 2v), we obtain 

2(1 — 2v) _ T ^ — Fjjk},mm + Xk — 0. 

With a proper change of dummy indices, this is 

2(1 - 2 is) ~ 2 Fj ' jiik + 2 Fk ' jjii +Xk=0 - 
The coefficient of the first term vanishes if 
c — 2(1 - is). 

t In order that Eqs. (5) and (6) both represent the same displacement field, we may choose 
Fi — Fi + Hi, where Hi are arbitrary harmonic functions (Hi t jj = 0) and demand that 
Fjj = (4> + Hj,j)/{c - 1). 
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Therefore, we conclude that the basic equation of elasticity is satisfied when 
(7) A 2 Gm = 2(1 - v)Fijj - Fjji , 


if Fi satisfies the equation 


(8) A 


Xj 

1 -v ' 


TTie vector Fi so defined is called the Galerkin vector. If Xi ~ 0, Eq. ( 8 ) 
is said to be biharmonic and its solutions are called biharmonic functions. 
Thus, to solve a problem in static equilibrium is to determine the three 
functions F\, F 2 , F 3 . 

From Eq. (7), we can derive expressions for the stresses in terms of Fi. 
In unabridged notation, we have, with 

dx 2 dy 2 dz 2 ’ 


the following results: 

(9) V 4 F 1 = --^-, 

l — v 


( 10 ) 

( 11 ) 

( 12 ) 


(13) 


(14) 


2 °’“= 2 < 1 -,)WF,-A(g + |k + g), 


2G 


du 2 
dx\ + 5^2 


+ S) = < 1 - 2l '> v2 ® + S + S)' 


<,,,=2(1-0^^, 




+ vV 2 


d 2 \(dF l dF 2 dF 3 \ 


dxf J \ dx\ dx 2 dx 3 / ’ 

a 2 / <9F\ 8F 2 of 2 \ 

dx\dx 2 \9a;i dx 2 dx 3 ) ’ 

(a u + <r 22 + 033) = (1 + ^)V 2 • 


Other components u 2 , u 3 , a 22 , < 733 , etc., are obtained by cyclic permutation 
of the subscripts 1, 2, 3. 
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Problem 8.4. Determine the body forces, stresses, and displacements de¬ 
fined by the following Galerkin vectors in rectangular Cartesian coordinates: 

(a) Fi = F 2 = 0, F 3 = R 2 , 

(b) Fi = yR 2 , F 2 = -xR 2 , F 3 = 0, 

(c) Fi — F 2 = 0, F 3 — z 4 , 

where R 2 = x 2 + y 2 + z 2 . 

Problem 8.5. Show that the Lame strain potential <f> can be identified with 
the divergence — Fi,i, if the body force vanishes and if Fi is harmonic. 

8.5. EQUIVALENT GALERKIN VECTORS 

If .two Galerkin vectors Fi and F t define the same displacements, they 
are equivalent. Let Fi and Fi be two equivalent Galerkin vectors. Then 
their difference 

(1) Fi = Fi -Fi 

is a biharmonic vector and corresponds to zero displacement, ttj = 0. A 
general form of Fi can be obtained if we set, as in Sec. 8.4, 

(2) Fi — cfijj + fj,ji 

and determine c such that u* = 0, where 


2Gui — 2(1 v)Fi t jj Fjji . 

The result is 

(3) Fi = (1 — 2 v)fi,jj + fj,ji , 

provided that 

(4) fi,jjkk = 0 • 

As an application, let us consider a Galerkin vector the third component 
of which does not vanish, 

(5) F l} F 2 ,F 3 ^Q. 

It will be shown that an equivalent Fi with F 3 = 0 can be formed if the 
body force Xi = 0. A particular solution is obtained by taking a special 
vector fi 


(6) 


/l = / . fi = /3 = 0 • 
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Then, from Eqs. (3) and (5), 


(?) 

(8) 

(9) 


Fi = Fi —Fi 


(1 - 2v)V 2 + 


dx 2 


/, 


f *2 = F2 —F2 


d 2 f 

dxdy ' 


F3 = F 3 — F3 


Fa 


d 2 / 

dxdz ’ 


jF 3 = 0. 


Now, since = 0 implies that F 3 is biharmonic, it will be possible 
to determine a biharmonic function / that satisfies Eq. (9). When / is so 
determined, we could compute Fi and according to Eqs. (7) and (8). 
Thus, when the body forces are zero, any Galerkin vector has an equivalent 
Galerkin vector with a zero component in the direction of z. 


8.6. EXAMPLE-VERTICAL LOAD ON THE HORIZONTAL 
SURFACE OF A SEMI-INFINITE SOLID 

Consider a semi-infinite homogeneous, isotropic, linear elastic solid 
which occupies the space z > 0 as shown in Fig. 8.6:1, and is subjected 
to a sinusoidally distributed vertical load on the boundary surface z = 0, 
so that the boundary conditions are 

-rex iry 

(a) a zz = A cos — cos — , 

(1) XL at z = 0; 

(b) <J ZX = a zy = 0 . 

The boundary condition at z = 00 is the vanishing of all stress components. 



Fig. 8.6:1. Sinusoidally distributed load acting on a semi-infinte solid. 
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The problem can be solved by taking a Galerkin vector with one non¬ 
vanishing component, 

(2) F\ = F 2 = 0, F 3 = Z. 

The function Z must be biharmonic. It can be easily verified that, if ip is a 
harmonic function, then the following function is biharmonic (cf. Sec. 8.11): 

(3) Z = (A + Bcz)ip. 

The form of the boundary condition (la) suggests that we should try 

(4) ip = cos — cos — f[z). 

On substituting (4) into the Laplace equation V 2 -i/> = 0, one finds that 
/(*) = e ~° z , 


where 



The other solution e cz must be rejected by the boundary condition at 
infinity. 

The stresses corresponding to the Galerkin vector above can be easily 
derived. Note that 

V 2 Z = 2 Bc^- = -2 Bc 2 il >, 
az 

dZ 

(6) = -cA4> + Bc{ 1 - cz)ip , 

(^ + |?) z = “ ( - 4+B “ )cV ' 

Hence, by Eqs. (8.4:13), 

(7) 0 zx = J^[2i/Be 2 - {A + Bcz)c 2 ]il). 

The boundary conditions (lb), that a zx and a zy vanish at the surface 2 = 0, 
are satisfied if 

(8) A = 2i >B . 

Then, Eq. (8.4:12) yields 

(9) a zz = j5c 3 (1 + cz)tp = Bc 3 ( 1 + cz) cos ~ cos ^j-e~ cz . 

L Lj 
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A comparison with the boundary condition (la) shows that all boundary 
conditions are satisfied by taking 


( 10 ) 


A = 


2 vA 


The resulting displacements and stresses are as follows, where we write, for 
convenience, 


<U) — f, >-$. 

Aa 

u x = ~ 2 {-l+2v + cz)sinaxcos(3ye cz , 
A/3 

u y = — 1+2v + cz) cos ax sin/3ye cz , 

A 

u z = [2(1 - v) + cz] cos ax cos (3ye~ cz , 

A 

axx = (-a 2 - 2u/3 2 + a 2 cz)ip , 

(12) a yy = ~(-(3 2 ~~ 2va 2 +/3 2 cz)i(j , 

cr zz = —> 1(1 + cz)ip , 

Aaf3 „ v . . . _ C2 

<J xy = —^ 2~(1 ~ 2> — cz)smaxsmpye , 

ciyj = —A/3z cos ax sin/3ye~ cz , 

(J zx = —Aaz sin ax cos (3ye~ cz . 


Thus, stresses are attenuated exponentially as the depth z is increased; the 
rate of attenuation c depends on the wave lengths l and L. 

Solutions of this form may be superposed together to produce further 
solutions. The method of Fourier series may be used to obtain periodic 
loadings on the surface z = 0, and the method of Fourier integral may be 
used to obtain more general loadings. 


8.7. LOVE’S STRAIN FUNCTION 

A Galerkin vector that has only one nonvanishing component Fa is called 
Love’s strain function and shall be denoted by 

(1) F 3 = Z. 
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In many applications it may be desired to express Z both in rectangular 
coordinates x, y, z and in cylindrical coordinates r, 0, z. In both cases, 

(2) = - rr^> 

where 

v 2 -— — 1 02 9,2 

dx 2 dy 2 dz 2 dr 2 ^ r dr r 2 dO 2 dz 2 

and X 2 is the body force per unit volume in the 2 -direction — the only 
nonvanishing component that can be treated by such a strain function. 

On putting F\ — F 2 = 0 and F 3 = Z in Eqs. (8.4:10)-(8.4:14), we obtain 
the following expressions for the physical components of displacements and 
stresses: 

In rectangular coordinates: 


(3) 


92 Z 92 Z 
2Gu *~ dxdz ’ 2GUy ~ dydz ’ 


2 Gu, = 




z , 


O'xx 

+ <fyy 

+ <T 2z : 

= (!+«') 

-1 

^IzV 2 

Q2 ' 

) z, 


dz 1 

dx' 2 t 



-1 

^V 2 

9 2 ' 

\ z, 



dy 2 . 



a 

dz 

(2- 

(z)V 2 - 

9 2 ‘ 
9z 2 

z, 

a 

dx 


1 /)V 2 - 

9 2 ' 
dz 2 

\z, 


dv 2 z 

dz 


&Z 

dxdydz ’ 


d_ 

dy 


d 2 


z. 
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In cylindrical coordinates : 


2Gi r = 
2Gf z = 


9 2 Z 
9r9z ’ 


2 ^ r dOdz ’ 


2(1 - r)V 2 - 


9^_ 

9z 2 


0 = (J rr + <700 + Czz — (1 + R) 

a " = ii{^-h) z ' 


dV 2 Z 

dz 


d / a l a l a 2 \„ 

(4) - 92 V V r 9r r 2 90 2 J ’ 


dz 


(2 - i/)V 2 - 
9 3 


9 2 

9z 2 


a 3 

° r6 drdOdz \ r / ’ 


1 9 

0Oz = --SS 
r 90 

9 

^zr — 

ar 




z, 


U-)v’-^]z. 


Love introduced, in 1906, the strain function Z as a function of r and 2 
only in treating solids of revolution under axis-symmetric loading. 

8.8. KELVIN’S PROBLEM — A SINGLE FORCE ACTING 
IN THE INTERIOR OF AN INFINITE SOLID 

Let a force 2 P be applied at the origin in the direction of z (Fig. 8.8:1). 
This concentrated force may be regarded as the limit of a system of loads 
applied on the surface of a small cavity 
at the origin. The boundary conditions 
of the problem are: (1) At infinity, all 
stresses vanish. (2) At the origin, the 
stress singularity is equivalent to a con¬ 
centrated force of magnitude 2 P in the 
z-direction. 

The symmetry of the problem sug¬ 
gests the use of cylindrical coordinates, 
Fig. 8.8:1. Kelvin’s problem. and Love’s strain function Z(r, z) suggests 
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itself. Since body force is absent, Z must be a biharmonic function whose 
third partial derivatives should define stresses that vanish at infinity, but 
have a singularity at the origin. One such function is 


(1) 


Z = BR = B(z 2 + r 2 ) 1 / 2 


(the function is harmonic, hence R= R 2 ^ is biharmonic, see Theorem 2, 
Sec. 8.11), for which 


( 2 ) 


Therefore, 


dZ 

dr 

dZ 

dz 

v 2 z 


Br 

d 2 z 


(l 

r 2 \ 

Bz 2 

~ R ’ 

dr 2 

- B 1 

\R~ 

' R 3 ) 

~ R 3 

Bz 

d 2 Z 

o I 

(i 

z 2 \ 

Br 2 

~ R ' 

dz 2 

= B \ 

\R~ 

R 3 J 

~ R 3 


( 3 ) 


2B 

R 


2Gi r = ^ , 2Gfc = 0, 


2 GC, = B 


= B 


2(1 - 2v) 1 z 2 

R + R + R 3 


(1 - 2 v)z 3r 2 z 

W 


(ree 


R 3 

(1 — 2 u)Bz 


R 3 


= -B 


= -B 


(1 — 2i i)z t 3z 3 ' 

l 3 + 72 s j 

(1 - 2i Ar 3rz 3 

R 3 + ~Rf 


PzO = VrO = 0 . 


These stresses are singular at the origin and vanish at infinity, and they 
have the correct symmetry. Therefore, the stress singularity can only be 
equivalent to a vertical force. Thus, Eq. (1) is indeed the desired solution if 
this force can be made equal to 2 P by a proper choice of the constant B. To 
determine B, we consider a cylinder with a cavity at its center at the origin 
and with bases at z = ±a (Fig. 8.8:2). Since this cylinder is in equilibrium, 
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the resultant of surface tractions on the 
outer surface must balance the load 2 P 
acting on the surface of the cavity at 
the origin. Therefore, we must have, 
when the radius of this cylinder tends to 
infinity, 

rOO 

2 P = / 27r rdr(-<j zz ) z ^ a 

Jo 


r-H'ffii'K 


2 P 


M t tt 11 


+ 


+ 


/ 2irr dr(cr zz ) z= - a 
Jo 

fa 

lim / 2nr dz(o TZ ) 
r —voo J_ a 


Fig. 8.8:2. Boundary used in 
evaluating the integration con¬ 
stant. 


The values of the first and the second integrals are seen to be the same, 
and the third integral vanishes in the limit. Noting that rdr = RdR, 
we have 

2P = 2 f 2nRdR(-a zz ) z=a 
Jo 


= AirB 




— 8tt(1 - u)B . 


Hence, 

( 4 ) 


B = 


P 

47t(1 — v) 


and the solution is completed. 

An especially simple situation results when Poisson’s ratio is Then 
the factor (1 — 2v) vanishes, and the stresses become 


( 5 ) 


a rr = 


Ozz - 


3 Pr 2 z 
2nR 5 ’ 
3 Pz 3 
2i tR 5 ’ 


a ee — 0 , cr r g = cr z e = 0, 


3Prz 2 
2ttR 5 


These results appear even simpler if the stresses are resolved in directions 
of spherical coordinates. In any meridional plane (see Fig. 8.8:3), at a 
point whose coordinates are (r, z) in cylindrical and (R, tp) in spherical 



Sec. 8.8 


KELVIN’S PROBLEM — A SINGLE FORCE ACTING . . . 


257 


\ 



Fig. 8.8:3. Simple interpretation of the solution to Kelvin’s problem when Poisson’s 
ratio is 1/2. 

coordinates, we find 

(6) app = a rr sin 2 <p + a zz cos 2 p> + 2a rz sin <p cos ip 

_ _ r 2 , _ z 2 . n rz _ 3 Pz 

— Grr p? -r &ZZ p? T "rz — 2airR s * 

Similarly, we have 

( 7 ) (Ttpip — *7Rtp — &66 * 0 , 

and, by symmetry, 

(8) ape = 7^=0- 

Hence, app, a w , agg are principal stresses, and the only nonvanishing 
component is apR. If the solid were divided into many cones extending 
from a common vertex at the origin, each of these cones would transmit its 
own radial force without reaction from the adjacent cones. 

In the special case of Poisson’s ratio v = ^ , it is evident that the solution 
of the Kelvin problem also furnishes the solution to Boussinesq’s problem 
of a normal force and Cerruti’s problem of a tangential force acting on the 
boundary of a semi-infinite solid (see Fig. 8.8:4). When v = Eqs. (5)-(8) 
hold for Boussinesq’s problem when the solid occupies the space z > 0, and 
for Cerruti’s problem when the solid occupies the space x > 0. 

The observation that the solutions of Boussinesq’s and Cerruti’s prob¬ 
lems take on such simple form when Poisson’s ratio is ^ led Westergaard 
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Fig. 8.8:4. Boussinesq and Cerruti’s problems. 


to consider a method of solution for these problems by perturbation of 
Poisson’s ratio, a method to be described in the next section. 

Originally, Lord Kelvin, Boussinesq, and Cerruti obtained solutions to 
the problems which now are adorned by their names (in 1848, 1878, 1882, 
respectively) by an extension of the method of singularities in the theory 
of Newtonian potentials. Lord Kelvin (1848) discovered that a particular 
solution of Navier’s Eq. (7.1:9) is 

(9) ui{x) - AJJJ [flM - (xj - (£)] dtidtidfr , 

where 

r = [(xi - £i) 2 + (x 2 - 6) 2 + (%3 ~ 6) 2 ] 1/2 , 

A + G A + 3G 

8ttG(A + 2G) ’ A + G ' 

In this formula x stands for (xi,x 2 ,X 3 ), and £ for (£i,£ 2 ,£ 3 ). The quantity 
r is the distance between the field point x to the variable point £. The 
functions Xi(£) are the components of the body force Xi expressed in terms 
of the variables of integration &. If an idealized case is considered in which 
a small sphere is isolated in the body, in which the body force concentrates, 
the limiting case will provide a solution of the Kelvin problem named above. 
A combination of such a solution with some other singular solutions of 
Navier’s equation yields solutions to the Boussinesq and Cerruti problems. 
A general theory for this adaptation was given by Betti (1872), who showed 
how to express the dilatation and rotation at every point in a solid body 
by surface integrals containing explicitly the surface tractions and surface 
displacements. A lucid exposition of this approach can be found in Love, 
Elasticity, 1 1 Chapter X. 
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The corresponding problems of specific normal and tangential displace¬ 
ments at the origin on a semi-infinite solid were also solved by Boussinesq 
(1888). There are numerous ways of arriving at these results. For refe¬ 
rences, see Love, 11 p. 243. 

Problem 8.6. Investigate the system of stresses corresponding to 

(a) Strain potential <j> = log (it + z), R 2 = x 2 + y 2 + z 2 . 

(b) Love’s strain functions Z — log(i? + z), 

(c) Z = xlog(/i + z). 

Find a strain potential that is equivalent to (b). 


8.9. PERTURBATION OF ELASTICITY SOLUTIONS BY A 
CHANGE OF POISSON’S RATIO 

In many problems of elasticity, the effect of Poisson’s ratio is relatively 
unimportant. In two-dimensional problems without body force, Poisson’s 
ratio has no effect on the stress distribution at all. In fact, this is the 
basic justification of most photo elasticity studies, in which the selection of 
materials for the models can be made regardless of elastic constants. 

In three-dimensional problems, the effect of Poisson’s ratio generally 
cannot be disregarded, but the examples of Boussinesq’s and Cerruti’s 
problems in Sec. 8.8 show that particularly simple solutions can be ob¬ 
tained when Poisson’s ratio assumes a certain particular value. This leads 
one to consider the effect of Poisson’s ratio. Westergaard proposes to ob¬ 
tain a solution by first solving a problem for a specific value of Poisson’s 
ratio m, and then determining the necessary corrections when the actual 
value of v is used. The value of the shear modulus G is considered fixed. 

Consider a body bounded by a surface S. Let Ui be the solution of a cer¬ 
tain problem, i.e., it satisfies the Navier equation (in rectangular Cartesian 
coordinates). 

1 X 

(1) z ~ c,i “I - Uj qjq, -| — 0, e = u a a , 

1 — 2i/ G 


and assumes on the boundary 5 surface traction Ti or surface displacement 

Ui. 

Let u* be a solution of another problem, satisfying, for a different value 
of Poisson’s ratio m, but the same value of G, the equation 
_ 1 

( 2 ) 

T *, or u* on S. 


1 — 2m 


e* + u* 


+ §-■ 
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This problem shall be assumed to have been solved, and is the starting 
point for further investigation on the effect of variation of Poisson’s ratio. 
Subtracting Eq. ( 2 ) from Eq. (1), and writing 


(3) u** = Ui — u* , e** = e t i~ e 

etc., we obtain 

1 1 


xr = Xi- x ;, 


(4) 

and 


1 — 2is 1 — 2 m ) ' l 1 — 2v 


i »* x ** „ 

e i + ~ ~ e ,i + u i,aa + — 0 


G 


(5) 


T** —Ti — T* , or u** = Ui — u* on S. 


Equations (4) and (5) define another problem in the linear theory of elastic 
equilibrium. Any solution of Eqs. (4) and (5), added to a solution of Eq. (2), 
which corresponds to Poisson’s ratio m, will yield a solution of Eq. ( 1 ) with 
the value of Poisson’s ratio equal to i/. 

To make this concept useful, let us consider a class of problems such as 
Boussinesq’s, in which the boundary conditions take the form of specified 
surface tractions; e.g., 


( 6 ) a 33’ <T 3 i >°32 are specified on S : x 3 = const. 

Then it is expedient to consider a problem to solve Eq. (4) with 

(7) a 33 = 0 - 3 * = (T 32 =0 on S : X 3 = const. 

When such a particular solution is found, it can be added to the solution 
of Eq. ( 2 ) to form a solution of Eq. (1) with the same specified boundary 
values of 033 , < 731 , 032 as listed in Eq. ( 6 ). 

Westergaard 1,2 shows how to construct a general solution in which O 33 , 
< 73 *, and cr |2 vanish identically throughout the elastic body. The method is 
based on the assumption that the components of displacement are derived 
from a scalar function tp according to the rule 


( 8 ) 2Gu 1 — (p^i , 2Gu^ — 2 , 2Gu^* - — tp* 3 . 

Westergaard calls this a “twinned gradient,” the third component —^,3 
being regarded as the “twin” of the ordinary gardient (p t 3 . Now 

2v 

&ij = -— —Ge aa 6ij + 2Ge{j , 

„ 2m „ 

aij ~ 1 - 2 m GeaaSij + 2( e ' lJ ' 


( 9 ) 
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Hence, 


(io) <^ = ^-4 

2 G(v - m) 


e* aa Si j + Y% e ^ + 2 G <i- 


(1 - 2i/)(l - 2m) 

Prom Eq. (8), we have 

(11) = 0, e 31 = 0. 

Hence, 

(12) *55=0, o 3 *; = o. 

Furthermore, 

(13) 2Ge*„ = ¥>,n + <p t 22 - y>,33 = VV - 2¥>,33 , 
and, from Eq. (10), 

<14 > = (1 + T^ <VV " 2v ' 33) “ v ' 33 ■ 

Hence, if we choose to have 


(15) 


= 0 , 


then <p must satisfy the equation 
(16) - ¥>,33 


2G(v — m) „ 
(1 — 2m) e ““ 


With ¥> so chosen, Eq. (4) will have to yield the required value of the body 
force X** in order to keep the body in equilibrium. However, it is simpler 
to proceed as follows. Compute first the stress ctJ'J', 


(17) all = *11 ~ *33* = 2G(eH - «Sa) = <P.n + V. 33 ■ 

Then the only nonvanishing components of stress are 

(18) <^n = V 2 ¥> - ¥>,22 , *22 = - ¥’,n . = ¥ ? ,i 2 ■ 

The equations of equilibrium expressed in stresses show at once that the 
components of the required body force are 

(19) -xr = (V 2 ¥>),1 , -xr = (V 2 v>),2 . 

The solution is established if a function f can be found that satisfies 
Eqs. (16) and (19) simultaneously. 
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In a majority of significant problems, both the original and the final 
body forces are zero; i.e., 

(20) Xi = X* = X** = 0 . 


Then (16) and (19) can be satisfied if 

, , , 2G(u — m) . i/ — m * 

(21) V V = 0 and V , x = \ ^ e.„ 


But e* a is a harmonic function. Hence, these two equations can be satisfied 
simultaneously, and the method is established. 

The formulas given above are valid as long as u and m are not exactly 
equal to A. In case m —> the dilation e* tends to zero, but the bulk 
modulus tends to infinity. The sum of normal stresses 


( 22 ) 


„ 2G(l+m) „ 

aii = (1 - 2m) 


remains finite as m It is easy to show that all the formulas in this 

section remain valid when m — provided that the e* H term is replaced by 
the a*\ term according to Eq. (22), as in the last equation of Eq. (21). 


8.10. BOUSSINESQ’S PROBLEM 

A load P acts at the origin of coordinates and perpendicular to the 
plane surface of a semi-infinite solid occupying the space z > 0 (Fig. 8.8:4). 
When Poisson’s ratio is 5 , the problem has the simple solution as stated in 
Eqs. (8.8:5) which gives 

( 1 ) 6 *=a 11 +<t 22 +<t 33 =, R 2 = z 2 + r 2 — x 2 y 2 + z 2 . 

If Poisson’s ratio is u, and m 
in this case, 

(2) VV = 0, 

Integrating, we have 


= 1/2, Westergaard’s Eq. (8.9:21) becomes, 

d 2 ip _ (1 - 2 u)P _z_ 
dz 2 27r R 3 


(3) 


dtp (1 — 2u)P 
dz 2 ttR 


(1 - 2 v)P 

v = -^— 


log(i? + z). 


It can be shown that <p is harmonic; therefore the problem is solved. 
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On substituting Eq. (3) into Eqs. (8.9:8)-(8.9:14), and adding to the so¬ 
lution for m = 5 , we obtain the final results, in which the only nonvanishing 
components are 


(4) 

(5) 


^ AixGR 
c 

4tt GR 


rz (1 - 2 v)r 
W- ~ R+z 

2 ( 1 “ ^ + R? 


( 6 ) 

(7) 


0 •— &TV &06 “ 1 ” &zz 


(j+_^) p J_ 

it R 3 ’ 



3r 2 z (1 - 2 u)R 
R 3 + R + z 


( 8 ) 

(9) 

( 10 ) 


z R 
R R + z 

3 Pz 3 
2 nR 5 ’ 


&60 = 


(1 - 2 v)P 
2t tR? 


3 Prz 2 
~ ~ 2nR 5 ’ 


where r 2 = x 2 + y 2 , R 2 = r 2 + z 2 = x 2 + y 2 + z 2 . 


Problem 8.7. Solve the Boussinesq problem by a combination of a 
Galerkin vector 

F 1 =F 2 = 0, F 3 = BR, R = (r 2 + z 2 ) 1/2 

and a Lame strain potential $ = clog (R + z). Show that c = —(1 — 2 v)B, 
B = P/2tt. 


Problem 8.8. Solve Cerruti’s problem by the method of “twinned gradient” 
(Reference, Westergaard, 1 ' 2 p. 142). 


8.11. ON BIHARMONIC FUNCTIONS 

We have seen that many problems in elasticity are reduced to the solu¬ 
tion of biharmonic equations with appropriate boundary conditions. It will 
be useful to consider the mathematical problem in some detail. 

We shall consider the equation 

V 2 V 2 u = 0, 


( 1 ) 
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where the operator V 2 is, in rectangular Cartesian coordinates ( x,y,z ), 


( 2 ) 


V 2 = 


d 2 d 2 


d 2 


dx 2 dy 2 dz 2 ’ 
in cylindrical polar coordinates ( r,9,z ), 


(3) 


x = r cos 9, y = r sin 9 , z = z, 
2 _ ld_ l_cP_ 

dr 2 r Sr ^ r 2 d # 2 dz 2 


and, in spherical polar coordinates (R,ip,9), 


x = Rsmtpcos9, y = Rs'uupsm9, z = Rcos(p, 

„ a _ 1 d ( n2 d \ 1 d f . a \ 1 a 2 

(4) V ~ R 2 dR \ K dRJ + R 2 sm<p 8<p \ Sm<fi d<p) + R2 s i n 2 ^ d9 2 
_ i 9 2 2 <9 1 d 2 coty? 5 Id 2 

“di? 2 + SaR + fi 2 ^ 2 + _ R 2_ ^ + 7?2 sin 2 y) ^2- 

The V 4 operator is obtained by repeated operation of the above. Thus, 


(5) 


V 4 


/ d 2 d 2 a 2 \ / d 2 d 2 a 2 \ 

\da: 2 dy 2 dz 2 / \da: 2 dy 2 dz 2 / 

a 4 a 4 a 4 a 4 a 4 a 4 

dz 4 dy 4 dz 4 + ^ dx 2 dy 2 + ^ dx 2 dz 2 + ^dy 2 dz 2 


A regular solution of Eq. (1) in a region 7Z is one that is four times 
continuously differentiable in 1Z. A regular solution of Eq. (1) is called a 
biharmonic function. Since Eq. (1) is obtained by repeated operation of 
the Laplace operator (2), and the regular solution of the equation V 2 u = 0 
is called a harmonic function , it is expected that biharmonic functions are 
closely connected with harmonic functions. In fact, we have the following 
theorems due to Almansi. 8 1 


Theorem 1. If u\, are two functions, harmonic in a region 7Z(x,y,z), 
then 


(6) u = XU\ + U 2 

is biharmonic in 7Z. Conversely, if u is a given biharmonic function in a 
region 7Z, and if every line parallel to the x-axis intersects the boundary of 
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V, in at most two points, then there exist two harmonic functions iti and 
U 2 in 7Z, so that u can be represented in the form of Eq. (6). 

Proof. The first part of the theorem can be verified directly according to 
the identity 


(7) V 2 (pp) = V + pV 2 p + 2 


/ dtp dip dtp dp 
\dx dx dy dy 


dp dp\ 
~dz~dz ) ' 


To prove the converse theorem, we note that the theorem is established if 
we can show that there exist a function ui such that 


(8) (a) V 2 uj = 0, (b) V 2 (a:ui — u) = 0. 


By virtue of Eq. ( 8 a), the second equation can be written as 

(9) V 2 u = V 2 (xu 1 )-2^i, 

OX 

which has a particular solution 

(10) Ui(x, y,z) = [ iv 2 «(£, y, z) , 

Jxo * 

where xo is an arbitrary point in the region 71. This particular solution 
does not necessarily satisfy Eq. ( 8 a). However, since u is biharmonic, we 
have 

Hence V 2 uj is a function v(y,z) of the variables y,z only. Now let us 
determine a function u\{y, z) so that 

/ d 2 d 2 \ _ 

( 12 ) {w + w) ^ = 

for example, by 

(13) Si (y, z) = - JJ (log r) • v(r), Q dy dC ,, 

where r 2 = (y - r/) 2 + (z — () 2 and the integral extends through the region 
71. Then the function ui = Hi + ui satisfies both conditions (9) and ( 8 a) 
and the theorem is proved. 

By a slight change in the proof it can be shown that Theorem 1 holds 
as well in the two-dimensional case. 
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Similarly, we have the following 


Theorem 2. If u j, u 2 are two harmonic functions in a three-dimensional 
region TZ, then 


(14) 


u — (R 2 — Rq)ui + U 2 , 


is biharmonic in TZ, where R 2 = x 2 + y 2 + z 2 and Rq is an arbitrary con¬ 
stant. Conversely, if u is a given biharmonic function in a region TZ, and 
if 1Z is such that, with an origin inside 1Z, each radius vector intersects the 
boundary of TZ in at most one point, then two harmonic functions u 1 , 1 x 2 
can be determined so that Eq. (14) holds. 

Proof. The proof of the first part again follows by direct calculation. Since 
Ml, U 2 are harmonic, an application of the identity (7) yields 


(15) 


, / du x du 1 du 1 

V*u = u,VW + 4 [ x — + y—+z — 


= 6ui+ 4 R 


dui 

OR 


V 2 V 2 u = 6 V 2 ui +8 


/ d 2 Uj 

\ dx 2 


d 2 u\ d 2 ui 
~dy 2 + Ih 2 


= 0 . 


To prove the converse theorem, we note that the theorem is established 
if we can determine a function u\ with the properties 


(16) (a) V 2 Ul =0, (b) V 2 [u — (R 2 — Ro)u\] = 0. 


Equation (16b) can be simplified, by virtue of Eq. (16a), into 

(17) V 2 u = 6ui + 4R^j^ . 

An integral of this differential equation is 

(18) Ul =R ~ 3 ' 2 [ R -o^ 2 V 2 udp. 

Jo 4 

It will now be shown that this integral indeed satisfies the condition (16a) 
and, hence, is the desired function. The demonstration will be simpler if 
the spherical coordinates are used. From Eq. (18), 


(19) W { R 2 dR ( R2 d R) + 


d 2 


R 2 sin 2 tp d 6 2 


R 2 sirup dip \^ m ^d(p 


J* \p 1 / 2 V 2 udp. 
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The operator in the square bracket can be taken under the sign of integra¬ 
tion. But, since u is biharmonic V 2 V 2 u = 0, we have 


'Id/ d\ 1 d 2 1 , d 

- -— sin<p— +^- 2 ~— V u = - — 

sin ip dip \ dip J sin <pcw 2 J dp 

Therefore, (19) may be written as 



VV 


V 2 


- J_ A 

~ R 2 dR 




- r - V 2 £\K ;('&)*■*■ 


On carrying out the indicated differentiation in the first term and integrat¬ 
ing the second term twice by parts, we obtain finally 


^ = hm\-\ R - w [ 
A{AA“' ,,2 v ^A v A v2 “ + 


RdV 2 u 
4 dR 


1 1 dV 2 u . 1 1 
4 R dR 


= 0 . 


Hence, iti given by Eq. (18) satisfies all the requirements, and the theorem 
is proved. 

Theorem 2 holds also in the two-dimensional case when R 2 is replaced 
by r 2 = x 2 + y 2 . The proof is analogous to the above. It is also evident 
that the choice of x in Theorem 1 is incidental. The theorem holds when x 
is replaced by y or z. 

Special representation of biharmonic functions in two-dimensions by 
means of analytic functions of a complex variable will be discussed in 
Sec. 9.5. 


Problem 8.9. Yih’s solution of multiple Bessel equations. Harmonic func¬ 
tions in cylindrical polar coordinates may assume the form Z(r) e lax e' 0e , where 
Z(r) is a Bessel function. Now consider the hyper-Bessel equation: 


( CP Id P 2 , 

+ r d? ~ r* +k J / = °’ 


n, positive integer. 
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With the help of Almansi’s theorems discussed above, show that if p (taken to be 
positive for convenience) is not an integer, the solutions are r m J ± ( p+m )(A:r), in 
which m = 0,1, 2,..., n — 1; otherwise they are r m J p + m (kr) and r m N p + m (kr), 
with m ranging over the same integers. The symbols J and N stand for the Bessel 
function and the Neumann function, respectively. (Chia Shun Yih, Quart. Appl. 
Math., 13, 4, 462-463, 1956.) 

Problem 8.10. Generation of biharmonic functions in cylindrical coordi¬ 
nates. If ui(z,x) is a harmonic function in a space TZ(x,y,z), then 
1 f 2n 

V\ = —■ u\(z,r cos 6) dQ, 

2tt Jo 

obtained by turning the space around the 2 -axis, is also harmonic. By selecting 
u(z,x) as the real part of ( z + ix) n , in which n is a positive integer, show that 
the following functions are harmonic in cylindrical coordinates ( r,6,z ), where 
r 2 = x 2 + y 2 . 

I 2 f 2 

^2=2 - 2 r i 

Ip3 = Z 3 - ^ zr 2 , 

tpi — z 4 - 32 2 r 2 + 2 r 4 , 

O 

■05 = 2 ; s — 5 z 3 r 2 + ^-zr 4 , 

8 

/ 6 15 42, 45 24 5 6 

i, 6 = z +_ zr -- r . 

Show that zipn and (z 2 +r 2 )ip n are biharmonic (n = 2, 3,..., ). Note: If u(z, x) is 
a biharmonic function, then the process indicated above generates a biharmonic 
function v(z, r). 


8.12. NEUBER-PAPKOVICH REPRESENTATION 


In Sec. 8.4, the displacement field Uj is represented by the Galerkin 
vector (Fi, F 2 , F 3 ), in the form 

(1) 2Gui = 2(1 - i/)V 2 Fj - Fjji i = 1,2,3. 


If we set 

( 2 ) V 2 Fi 

Eq. (1) becomes 


1 

2(1 — v) 






(3) 


2 Gui = . 
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Consider first the case in which the body force is absent. Since in the 
absence of body force the Galerkin vector must satisfy the biharmonic 
equation, we see that and ^ satisfy the equations 

(4) V 2 $i = 0 , V 4 $ = 0. 


Hence the <&’s are the harmonic and the T is biharmonic. They are related 
through Eq. (2) by 


(5) 


V 2 -Jr = 


2(1 - u) 


On noting that V 2 (xj <Pj ) = 2 $jj, we see that a particular solution of this 
equation is x j^j- Hence, the general solution can be written as 


( 6 ) 


= 


(l-*) 


Xj$j + 0Q , 


where 0o is an arbitrary harmonic function. On substituting into Eq. (3), 
we obtain 


(7) 


3 — 4 ^ 

2Gui = 771- 

4(1 - v) 


4(1 - v) 


Xj 


If we define 

( 8 ) 

(9) 




4(1 - v) ' 


k = 3 — 4i/, 


we get 

(10) ▲ 2 Gui = K(j>i — Xjtpj'i — 0o,* • 


This formula expresses m in terms of four harmonic functions <f> o, <j> i, 02, 
03. It was given independently by P. F. Papkovich (1932) and H. Neuber 
(1934) by different methods. The connection with the Galerkin vector was 
pointed out by Mindlin (1936). The special importance of the Neuber- 
Papkovich solution lies in its strict similarity to a general solution in two 
dimensions (Sec. 9.6), in which case a well-known procedure exists for the 
determination of the harmonic functions involved from specified boundary 
conditions. 

The question of whether all four of the harmonic functions are indepen¬ 
dent, or whether one of them may be eliminated so that the general solution 
of the three-dimensional Navier’s equation involves only three independent 
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harmonic functions, has been a subject of much discussion. See Sokolnikoff, 
Elasticity , 1,2 2nd ed. (1956), p. 331, and Naghdi 81 (1960). 

If the body force does not vanish, a general solution of Navier’s equation 
can be obtained by adding a particular integral to the Neuber Papkovich 
solution [see Eq. (8.8:9)]. 


8.13. OTHER METHODS OF SOLUTION OF 
ELASTOSTATIC PROBLEMS 

Two other classical methods of solving Navier’s equations for elasto- 
static problems must be mentioned. The first is Betti’s method, which was 
referred to in Sec. 8 . 8 . The second is the method of integral transformation 
(Fourier, Laplace, Hankel, Mellin, Stieltjes, etc.) For the former, see Love, 
Mathematical Theory 12 Chap. 10, and for the latter, Sneddon, Fourier 
Transforms 9 2 (1951), Chap. 10, and Fliigge, Encyclopedia of Physics , 14 
Vol. 6 (1962). 


8.14. REFLECTION AND REFRACTION OF PLANE P 
AND S WAVES 

So far we have considered only static problems. We shall now consider 
some dynamic problems in order to illustrate the use of displacement 
potentials in dynamics. 

According to Sec. 8.2, when the displacements are represented by a scalar 
potential <f> and vector potentials ip\, ip 2 , ip 3 through the expression 


Ui = 


d(j> , dip k 

dx, dxj 


a broad class of solution is obtained if p and ipk satisfy the wave equations 


cPp 5V _ 1 d 2 4> 

dx 2 ^ dy 2 ^ dz 2 c\ dt 2 ’ 

d 2 ipk d 2 ip k d 2 ip k = 1 d 2 ip k 
dx 2 dy 2 dz 2 Cj. dt 2 


The functions <j> and ip\, ip 2 , ip 3 define dilatational and distorsional waves, 
which are called, in seismology, the primary, or P (or “push”), waves, and 
the secondary, or S (or “shake”), waves. If we consider plane waves, as 
in Sec. 7.8, we see that the S waves are polarized. If an S'-wave train 
propagates along the rc-axis in the x, z-plane, (z “vertical”, x “horizontal”), 
and the material particles move in the 2 -direction (vertical), then we speak 
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of SV waves. If the S waves propagate along the x, 2 -plane but the particles 
move in the ^-direction (“horizontal”), then we speak of SH waves. 

Consider a homogeneous isotropic elastic medium occupying, a half¬ 
space 2 > 0. Since an elastic medium has two characteristic wave speeds, 
plane P waves hitting the free boundary 2 = 0 are reflected into plane P 
waves and plane S waves. Similarly, incident SV waves are reflected as 
both P and SV waves. If two elastic media are in contact with a “welded” 
interface, then incident P waves will be reflected in the first medium into 
P and S waves, and also refracted in the second medium in P and S waves. 
A similar statement holds for incident SV waves. The SH waves behave 
simpler. A train of incident SH waves will not generate P waves at the 
interface, so it is reflected and refracted in SH waves. 



Fig. 8.14:1. Reflection of a SV ray incident against a plane boundary. 


We shall show that the laws of reflection and refraction are the same 
Snell’s law as in optics. Thus, if we have two homogeneous isotropic elastic 
media M and Mi, of infinite extent and in “welded” contact at the plane 
2 = 0, as shown in Fig. 8.14:1, and if the directions of advance of the waves 
are all parallel to the x, 2 -plane as illustrated by rays in the figure, then 
for incident SV waves, 


( 1 ) 


C T _ CT _ C L _ Cf 
cos fo cos f cos e cos f\ 



cosei ’ 


(-•- / = /o) - 


In this equation, c/,, ct are, respectively, the longitudinal and transverse 
wave speeds of the medium M, and c^\ are the corresponding speeds 
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of the medium M\. The angle /o between the ray of the incident waves 
and the plane boundary is called the angle of emergence of the waves. Its 
complement is called the angle of incidence. Similarly, for incident SH 
waves, we have 


( 2 ) 


ct 

cos/o 



COS / COS /1 


and for incident P waves, 




(3) CL - CL — CT - ° L = C T 

coseo cose cos / cos e\ cos/i 

(so that e = eo). If we consider a half-space M with a free surface z — 0, 
we have these same equations, with the c^ \ cp terms, which are now 
irrelevant, omitted, of course. 

These results are easily proved. Let us work out the case of SV waves 
emerging against a free boundary. Other cases are similar. 

In SV waves emerging at an angle /o from a free plane boundary, the 
wave front has a normal in the direction of a unit vector whose direc¬ 
tion cosines are (cos/o, 0 ,sin/o); but in the incident SV waves the nor¬ 
mal to the wave front has a different direction, with direction cosines 
(cos/o, 0 , — sin/o). This change of direction excites a reflected P wave. 
We assume, therefore, that 


(4) 


<p = ${x cos e + z sin e — c^t) , ip\ = ^>3 = 0 , 
tp 2 = ip = To (x cos fo — z sin /o — crt ) + T (x cos / + z sin / — opt) ■ 
The displacements are 


(5) 


d<j) dip 
dx dz ’ 


dtp dip 

”’ = aI + S' 


and the stresses are given by 


(6) 


- r i 0 ^ 

° zx dxdz dx 2 dz 2 ) 


The boundary conditions are 
(7) at z = 0, 


cr zz = o zx = 0 . 
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On substituting Eqs. (4) and ( 6 ) into Eq. (7), we have 

( 8 ) (A + 2 Gsin 2 e)$"(a;cose — c^t) — 2G[cos/osin/o^'o(a;cos/o — c-pt) 

— cos / sin /\&"(x cos / — c^f)] = 0 , 

(9) 2cosesine$"(a;cose - cit) + (cos 2 /o - sin 2 fo)$o(xcos f 0 - c?t) 

+ (cos 2 / — sin 2 f)^"{x cos / - crt) = 0 . 


These equations can be satisfied for all values of x and t only if the argu¬ 
ments of the various $ and functions are in a constant ratio. Hence, 


( 10 ) 


CT _ C L _ C T 
cos fo cos e cos / 


Q.E.D. 


When Eqs. (10) are satisfied, the functional relationships between 'I'd if, 
and $ are given by Eqs. ( 8 ) and (9). A detailed study of such functional 
relationships yields information about the partitioning of energy among 
the various components of reflected and refracted waves — an important 
subject whose details can be found in Ewing, Jardetzky, and Press . 7 4 


8.15. LAMB’S PROBLEM—LINE LOAD SUDDENLY 
APPLIED ON ELASTIC HALF-SPACE 

Lamb , 7,4 in a classic paper published in 1904, considered the distur¬ 
bance generated in a semi-infinite medium by an impulsive force applied 
along a line or at a point on the surface or inside the medium. Lamb’s 
solution, as well as the extensions thereof, has been studied by Nakano, 
Lapwood, Pekeris, Cagniard, Garvin, Chao, and others. In this section 
we shall consider only the problem of a line load suddenly applied on the 
surface. (Fig. 8.15:1.) 

Consider a semi-infinite body of homogeneous isotropic linear elastic 
material occupying the space z > 0 . For time t < 0 , the medium is sta¬ 
tionary. At t — 0, a concentrated load is suddenly applied normal to the 
free surface z = 0 , along a line coincident with the j/-axis. The boundary 
conditions are therefore two-dimensional. We may assume the deforma¬ 
tion state to be plane strain. The displacement v vanishes, and u and w 
are independent of y. Under this assumption, only one component of the 
vector potential is required. According to Sec. 8.2, the displacements are 
represented by 

d(j> difj d<j) dip 

U dx dz ' W dz + dx 


( 1 ) 
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Fig. 8.15:1. A suddenly applied line load on an elastic half-space. 


Navier’s equations of motion are satisfied if <p and ip satisfy the wave 
equations 


, . d 2 <p d 2 cp 1 d 2 (p d 2 ip d 2 ip __ 1 d 2 ip 

dx 2 dz 2 c 2 l dt 2 ' dx 2 dz 2 dt 2 

The stress components are 

&yz = @ xy “ fi j 
(3) Oyy = v{a xx + <J ZZ ). 


(J zx = G 


dw du\ 


+ -s- )=G 2 


dx dz) 


d 2 (p d 2 ip d 2 ip 
dxdz ^ dx 2 dz 2 


(4) ,„ = Xe + 2G d -l^xm + m + 2G( d 4i 


dz y dx 2 dz 2 J 


dz 2 


xx — Ae + 2 G-— — A 
ox 


\dx 2 


dx 2 ^ clz 2 J 


The boundary conditions on the surface z = 0 are 


dxdzJ ’ 

d 2 ip \ 
dxdz ) 


(5) (^a zx ) z= o — (&zy)z=0 — 0 i (&zz)z=0 — Q5(x)l(t ), 

where 1(f) is the unit-step function 


(6) 


1 (f) = 0 for t < 0, 1(f) = 1 for t > 0 
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while 6(x) is the Dirac delta function, i.e., one whose value is zero every¬ 
where except in the neighborhood of x = 0 where it becomes infinitely large 
in such a way that 5(x)dx = 1. It is seen that the surface stress given 
by Eq. (5) is equivalent to a concentrated load Q per unit length suddenly 
applied on the line x = 0, z = 0 and maintained constant afterwards. 

For the conditions at infinity, it is reasonable to demand that (a) all 
displacements and stresses remain finite at infinity, and (b) at large dis¬ 
tances from the point of application of the load the disturbance consists of 
outgoing waves. These are called the finiteness and radiation conditions, 
respectively. 

In the present problem, the disturbance is propagated outward at a finite 
velocity, so conditions (a) and (b) are equivalent to the statement that there 
exists an outgoing wave front, beyond which the medium is undisturbed. 
The question arises whether the boundary conditions (5) and the finiteness 
and radiation conditions, in the absence of any other disturbances in the 
medium, will determine a unique solution of our problem. For a point load, 
the answer is obviously affirmative, because for a suddenly applied point 
load, the wave front will be at a finite distance from the point of applica¬ 
tion of the load at any finite time. Hence, if we take a volume sufficiently 
large so that it includes the wave front in its interior, we have a finite body 
over whose entire surface the surface tractions are specified. Neumann’s 
uniqueness theorem (Problem 7.1, Sec. 7,4) then guarantees a unique so¬ 
lution. For a line load, we do not have such a simple and general proof. 
In fact, difficulty may arise in two-dimensional problems. (For example, in 
the corresponding static problem — a static line load acting on the surface 
— the displacement at infinity is logarithmically divergent. See Sec. 9.4, 
Example 2.) However, for the present problem, a unique solution can be 
determined if we assume the deformation to be truly two-dimensional. 

The significance of the last assumption is made clear by the following 
remarks. Note that a cylindrical body subjected to surface forces uniform 
along the generators may have an internal stress state that is not uniform 
along the axis. For example, transient axial waves may be superposed with¬ 
out disturbing the lateral boundary conditions. In other words, a seemingly 
two-dimensional problem may actually be three-dimensional. Such an occa¬ 
sion also occurs in fluid mechanics. A nontrivial example in hydrodynamics 
is the flow around a circular cylinder, with a velocity field uniform at in¬ 
finity and normal to the cylinder axis. At supercritical Reynolds numbers, 
the three-dimensionality of the flow in the wake, i.e., variation along the 
cylinder axis, is very pronounced and becomes a predominant feature. 
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The boundary conditions (5) can be written in a different form. The 
unitstep function 1(f) has no Fourier transform. But if we consider it to be 
the limiting case of the function e _/3t l(f), which has a Fourier transform, 
then the Fourier integral theorem 

(7) f(x) = ~J°° e lkx dk j°° f(Z)e-' ki , 

which is valid for an arbitrary function f(x ) that is square-integrable in the 
Lebesgue sense, yields the representation 

l r°° e i^t 

(8) 1(f) = — lim / ——— du). 

K ; 27 T 0^0 (3 + ilj 


Equation (8) is obtained on substituting e~^l(£) for /(£) in Eq. (7) and 
changing k to to. Similarly, S(x) has no Fourier transform. But, considering 
the delta function as the limit of a square wave 


1 

£ 




as £ —> 0, wo can use Eq. (7) to obtain the representation 


(9) 


i r 

6{x) = lim -— / 
v ' e^o 2-ire J_ c 


sin ke ikx 

- e lKX dk. 

k 


Therefore, the second condition in Eq. (5) may be written 
—Q .. f°° f°° sin ke 1 


( 10 ) 


47r 2 


/ oo pc 

/ 

_ -oo J —oo 


ke (3 + iw 


(U,t + kx) _ 


From this, we see that if we can obtain an elementary solution of Eqs. (2) 
satisfying the boundary conditions 


( 11 ) 


Wxz}z ~0 = 0 , 


g e i(ut+kx) 


then by the principle of superposition the solution to the original problem 
with boundary conditions (5) can be obtained by setting 


( 12 ) 


Z(k,u>) 


-Qsiuke 1 
4x 2 ke (3 + itv ’ 


integrating with respect to k and u) both from — oo to oo, and then passing 
to the limit (3 0, e —► 0. 

The solution of the elementary problem is obtained by assuming 


(13) <t> = Ae ~'' z+ikx+iut , i/> = Be ~ l/ ‘ i:+ikx+ibJt , 
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which satisfy the wave Eqs. (2) if 

(14) u 2 = k 2 -k 2 a , v' 2 = k 2 - k 2 p , k a — — , k /3 

cl 


On substituting Eq. (13) into Eqs. (4) and (11), we obtain 


(15) 


-2Aivk - (2k 2 - kp)B — 0, 


(2k 2 - kl)A - 2Bikv' = - Z(k , u>). 


G 

Solving these equations for A and B, we obtain 

—2ikv Z(k, u>) 


(16) A = 

where 

(17) 


B = 


2k, 2 — kp Z(k, u) 

F(k ) G ’ ^ _ 'F(fc) G 

F(fc) = (2k 2 — kp) 2 — 4 k 2 uu' 


is Rayleigh’s function. 

A formal solution of our problem is obtained by substituting Eqs. (16) 
and (12) into Eq. (13) and integrating with respect to w and k from — oo 
to oo as indicated before. In so doing, an appropriate branch of the multi¬ 
valued functions u and v' must be chosen so that the conditions at infinity 
are satisfied. 

The evaluation of these integrals is a formidable task. Direct integration 
or numerical integration are exceedingly difficult. Lamb uses the method of 
contour integration. The variables of integration k and u> are replaced by 
complex variables, and the contours of integration are deformed in such a 
way that some explicit results are obtained. Details can be found in Lamb’s 
paper. For alternative approaches, some very elegant, see book by Ewing, 
Jardetzky and Press, and papers by Cagniard, Pekeris and Lifson listed 
in the Bibliography 7.4 at the end of this book. See also papers listed in 
Bibliography 9.2. 


PROBLEMS 


8.11. Derive potentials to solve the following equations: 


Example: 


du 

dy 


^=0 

dx 


is solved by taking u = 


d<j> 
dx ’ 


v = 


d<p 

dy ' 
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(b) Plane stress: 

do* dr xy _ dr x y da y _ 

dx dy ' dx dy 

(c) In the theory of membrane stresses in a flat plate: 


dN xy = o dN *y . dN v 

dx dy ’ dx dy 


(d) In theory of bending of plates: 

dQ* , dQy = 0 Mk + = o. 

dx dy ’ dx dy 


9M X y dMy __ 

dx dy y 


8.12. A train of plane wave of wave length L and phase velocity c can be 
represented as 

<j) = A exp + v?y + vsz ± cf)j 


where A is a constant and j/i, 1 / 2 , Fa are the direction cosines of a vector normal to 
the wave front. Consider suitable superposition of these waves, derive expressions 
for trains of: 


(a) cylindrical waves, and 

(b) spherical waves, such as those generated by a point source. 

Ans. (a) BJ 0 (kr)e- , ' lzi e iu ’ t , v 2 = k 2 - k 2 a , r 2 = x 2 + y 2 . 

(b) Cl exp[±i(k a R - ut) j, R 2 = x 2 + y 2 + z 2 . 

H 

Here B, C are constants, k a = w/c, and k is a parameter. The analysis 
may be simplified by means of contour integrations, regarding some 
variable of integration as complex numbers. 

8.13. High pressure vessels of steel will be designed on the basis of von 
Mises’ yield criterion. Consider spherical and cylindrical tanks of outer radius b 
and inner radius a. Compare the maximum internal pressure p at which yielding 
occurs. 

Ans. J 2 — k 2 = 0, k = yield stress in simple shear. 


Sphere: p yie id = 




-1/2 

k. 


Cylinder: p yie id = 



(far away from the ends). 


If 6/a = 1.1, p y ieid = 0.523fc for sphere, = 0.222 k for cylinder. 

8.14. Consider a spherical fluid gyroscope which consists of a hollow metallic 
sphere filled with a dense fluid. This sphere is rotated at an angular velocity 
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u> about its polar axis. In a steady-state rotation (u> = constant), what Eire the 
stresses and displacements in the shell due to the fluid pressure and the centrifugal 
forces acting on the shell? 

8.15. An infinitely long circular cylindrical hole of radius a is drilled in an 
infinite elastic medium. A pressure load is suddenly applied in the hole and starts 
to travel at a constant speed, so that the boundary conditions on the surface of 
the hole are, at r = a, 



The medium is initially quiescent. Determine the response of the medium. 

8.16. Shock tubes Eire common tools used in aerodynamic research. A shock 
tube consists of a long cylindrical shell, closed at both ends. Near one end a thin 
diaphragm is inserted normal to the tube axis. On one side of the diaphragm the 
air is evEicuated; on the other side, gas at high pressure is stored. In operation, 
the diaphragm is suddenly split, the onrushing gas from the high pressure side 
creates a shock front that travels down the evacuated tube. 

Elastic waves are generated in the tube wall by the bursting of the diaphragm 
and by the moving shock wave. These elastic waves have some effect on the 
instrumentation and measurements. Discuss the transient elastic response of the 
shock tube wall. 

8.17. An infinite elastic medium contains a spherical cavity of radius a. A 
sinusoidally fluctuating pressure acts on the surface of the cavity. Determine the 
displacement field in the medium. 
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TWO-DIMENSIONAL PROBLEMS IN LINEARIZED 
THEORY OF ELASTICITY 


The application of the Airy stress function reduces elastostatic problems 
in plane stress and plane strain for isotropic materials to boundary-value 
problems of a biharmonic equation. A general method of solution using the 
theory of functions of a complex variable is available. We shall discuss this 
method briefly and illustrate its utility in solving a few important problems. 

Throughout this chapter x, y, z represent a set of rectangular Carte¬ 
sian coordinates, with respect to which the displacement components are 
written as u,v,w, the strain components are e xx , e xy , etc., and the stress 
components are a xx , cr xy , etc. We recall the factor | in the definition of the 
strain components: 

_ 1 / dui dui \ 

2 dxi) 

When curvilinear coordinates are used, we retain the notations of 
Chapter 4, in which tq and ey denote the tensor components of the dis¬ 
placement and the strain, respectively; whereas £y denote the physical 
components of these tensors. See Secs. 4.10-4.12. 

The methods presented in this chapter, as those discussed in the two 
preceding chapters, are for the linearized theory of elasticity. We assume 
that the deformation gradient and velocities are small that the nonlinear 
terms in Eqs. (l)-(3) of Sec. 7.1 are negligible. In other words, the convec¬ 
tive acceleration, the convective velocity, and the products of deformation 
gradients are negligibly small compared with the retained linear terms in 
the acceleration, velocity, and strain tensor, respectively. There are beau¬ 
tiful methods and results in the linearized theory. There are spectacular 
panorama in the nonlinear theory. We believe, however, that the best way 
to learn the nonlinear theory is to master the linearized theory first. This 
is what this book tries to do. 

9.1. PLANE STATE OF STRESS OR STRAIN 

If the stress components a zz , a zx , a zy vanish everywhere, 

(1) & zz — & zx ~ Gzy — 0 i 
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the state of stress is said to be plane stress parallel to the x, y-plane. In 
this case, for isotropic materials, 


( 2 ) 

(3) 

(4) 

(5) 


e xx — (<Txx VCTyy) , e y y — ^ (<7yy V<y XX ) > 

V 1 _ 

€-zz = ’jgj \&xx “I” Gyy) 5 ^xy = > ^xz ~ &yz — ” , 

E E 

&xx = 7 ~^2 ( exx 4" Ve vv) > a vv = Y _ jy 2 ( e yy 4" ve X x) , 

E 

° x \) — ^ ~ &x v > 

ri'xx + <Tyy — ^ _ ( e xx + Cyy) , 

9u dv 

exx+eyy = ^ + ^- 


Substituting Eq. (3) into the equation of equilibrium (7.1:5), we obtain the 
basic equations for plane stress, 


(6) A 


/9 2 u 9 2 u\ l + i/ 9 /9u 9v\ 

\ dx 2 + 9y 2 J + 1 - i/ 9a: \ dx dy) 

/ d 2 v d 2 v \ 1 -\-vdfdu 9v \ 

\9x 2 dy 2 ) + 1 - v dy \9x + dy) 


+ X = 


9 2 u 

P W ’ 


+ y- 


d 2 v 

P ~d1?' 


If the z-component of displacment w vanishes everywhere, and if the 
displacements u, v are functions of x, y only, and not of z, the body is said 
to be in plane strain state parallel to the x, y-plane. In plane strain we 
must have 

._. du dv „ , . . . . 

(7) — = — = w = 0, and a zz - v(a xx + a vy ), (since e zz = 0). 

The basic equation (7.1:9) becomes, in plane strain, 


/9 2 u d 2 u\ 1 d fdu 9u\ d 2 u 

\9a; 2 dy 2 ) 1 — 2i/ dx \9x dy) P dt 2 

/ 9 2 u 9 2 u\ 1 9 / 9u 9tA _ 9 2 u 

\ 9x 2 9y 2 / 1 — 2 j/ 9y \ dx dy) P dt 2 


If i/ is replaced by i//(l + 1 /) in Eq. (8), then it assumes the form (6). 
Hence any problem of a plane state of strain may be solved as a problem of 
a plane state of stress after replacing the true value of u by the “apparent 
value” j//( 1 + i/).t Conversely, any plane stress problem may be solved as 


tThis substitution refers only to the field Eqs. (6) and (8). The boundary conditions, 
the stress-strain relationship, and the shear modulus G are not to be changed 
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a problem of plane strain by replacing the true value of u by an apparent 
value v/(l — 

The strain state in a long cylindrical body acted on by loads that are 
normal to the axis of the cylinder and uniform in the axial direction often 
can be approximated by a plane strain state. A constant axial strain e zz 
may be imposed on a plane strain state without any change in stresses in 
the x, j/-plane. Hence a minor extension of the definition of plane strain 
can be formulated by requiring that e zz be a constant, that u and v be 
functions of x, y only, and that w be a linear function of z only. 

The state of stress in a thin flat plate acted on by forces parallel to 
the midplane of the plate is approximately plane stress. However, since 
in general e zz does not vanish, the displacements u, v, w are functions of 
2 , and the problem is not truly two-dimensional. In fact, it can be shown 
that the general state of plane stress, satisfying Eq. (1), the equations 
of equilibrium, and the Beltrami-Michell compatibility conditions (7.3:6), 
is one in which the stresses cr xx , cr yy , a xy are parabolically distributed 
throughout the thickness of the plate, i.e., of the form f(x, y) + g{x,y)z 2 . 
(See Timoshenko and Goodier, 12 p. 241.) However, the part proportional 
to z 2 can be made as small as we please compared with the first term, 
by restricting ourselves to plates which are sufficiently thin (with the ratio 
h/L —> 0, where h is the plate thickness and L is a characteristic dimension 
of the plate). 

9.2. AIRY STRESS FUNCTIONS FOR TWO-DIMENSIONAL 
PROBLEMS 

For plane stress or plane strain problems, we may try to find general 
stress systems that satisfy the equations of equilibrium and compatibility 
and then determine the solution to a particular problem by the boundary 
conditions, 

Let a;, y be a set of rectangular Cartesian coordinates. For plane stress 
and plane strain problems in the x, t/-plane, the equations of equilibrium 
(3.4:2) are specialized into 



( 3 ) 


“t" 'm&xy — Pi 

TfXCJyy - Q ) 


Fig. 9.2:1. Notations. 
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where l,m are the direction cosines of the outer normal to the boundary 
curve and where p, q are surface tractions acting on the boundary surface. 
The strain components are, 


(a) In the plane stress case: 

e XX — Ti ( 17 XX ~ V(7yy) , 6-yy — — (Cyy ~ VO XX ) 1 

(4) E E 

1 _(1 + k) 

e xy ~ 2 Q Cx V ~ ' 

(b) In the plane strain case: 

&xx “ [(f ^ )&xx ^(1 "b v)(7yy\ j 

(5) e yy — — [(1 — v 2 )cr yy — ^(1 + i^crxz] i 

1 „ X 

e xy — (1 + V)a xy . 

In view of what was discussed in the preceding section, for very thin 
plates we may assume cr xx , a yy , a xy to be independent of z. Then the 
plane stress problem becomes truly two-dimensional, as well as the plane 
strain problem. 

The compatibility conditions are as follows (see Sec. 4.6): 


( 6 ) 


d 2 e xx . ®^ e yy 2 ^ e xy d 2 e xx d ( dCy Z dc xz de xy \ 

dy 2 dx 2 dxdy ’ dydz dx y dx dy dz ) ' 

d 2 e yy d 2 e zz = 9 2 e yx d 2 e yy = d_ f de yz _ de xz de xy \ 

dz 2 dy 2 dydz ’ dxdz dy y dx dy dz ) ’ 

d 2 e zz d 2 e xx d 2 e xz d 2 e zz d ( de yz de xz de xy \ 

dx 2 dz 2 dzdx ' dxdy dz y dx dy dz ) ' 


On substituting Eq. (4) into the first equation of Eqs. (6), we obtain, in 
the plane stress case, 

d 2 d 2 d 2 r x 

d ^ 2 ^ xx ~ VOvv) + dx 2 ( ' <Jm ~ V ° xx) = 2(1 + ^Itefy ’ 

Differentiating Eq. (1) with respect to x and Eq. (2) with respect to y and 
adding we obtain 

d 2 a xx d 2 a vy dX_ dY_ _ Jd 2 T xy 
dx 2 dy 2 dx dy dxdy 


(8) 
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Eliminating r xy between Eqs. (7) and (8), we obtain 


f d 2 d 2 \ 

( 9 ) + dy 2 J + (7 vv) = “(1 + ") 

Similarly, in the plane strain case, we have 


( 10 ) 


(d 2 d 2 \. . i 

^2 + ^72 + Oyy) ~ -JZ 


\ dx 2 dy 2 / 




(£ + £y 

V ox dy J 


(£ + £)■ 

\dx dy J 


Equations (1), (2), (3), and (9) or (10) define the plane problems in 
terms of the stress components cr xx , o yy , a xy . If the boundary conditions 
of a problem are such that surface tractions are all known, then the problem 
can be solved in terms of stresses, with no need to mention displacements 
unless they are desired. Even in a mixed boundary-value problem in which 
part of the boundary has prescribed displacements, it still may be advanta¬ 
geous to solve for the stress state first. These practical considerations lead 
to the method of Airy stress function.^ 

Airy’s method is based on the observation that the left hand side of 
Eqs. (1) and (2) appears as the divergence of a vector. In hydrodynamics 
we are familiar with the fact that the conservation of mass, expressed in 
the equation of continuity 


( 11 ) 


du dv 
dx ^ dy ’ 


where u, v are components of the velocity vector, can be derived from an 
arbitrary stream function ip(x,y): 


( 12 ) 



v = 


dtp 
dx ' 


In other words, if u, v are derived from an arbitrary tp{x, y) according to 
Eq. (12), then Eq. (11) is satisfied identically. 

Let us use the same technique for Eqs. (1) and (2). These equations can 
be put into the form of Eq. (11) if we assume that the body forces can be 
derived from a potential V, so that 


(13) 



Y = - 


dV 
dy ' 


A substitution of Eq. (13) into Eqs. (1) and (2) results in 


(14 > rJ 17 - 


V) + 


d(7 xy 
dy 


= 0, 


d(T X y 

dx 


+ Qy KV V ) — 0 ' 


t For problems in which displacements are prescribed over the entire boundary, the dis¬ 
placement potential or other devices of the preceding chapter should be tried first. 
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Now, as in Eq. (11), these equations are identically satisfied if we introduce 
two stream functions 'I' and x in such a way that 


(15) 


9$ 
dy ’ 

_ _ 

xv ~ dy ' 



_cM 
dx ’ 

dx 

dx 


In other words, a substitution of Eq. (15) into Eq. (14) reduces Eq. (14) 
into an identity in and Now, Eqs. (15) can be combined if we let 


(16) 


9$ _ 9$ 

X “ dx ’ ~ dy' 


i.e., 


(17) A 


_ 9 2 $ _ 9 2 $ 9 2 $ 

Pxx-V - g y2 , <7xy ~ g xdy , ° V y V- ^ . 


It is readily verified that if a xx , o xy , a yy are derived from an arbitrary 
function $(a:, y) according to Eq. (17), then Eqs. (14) are identically satis¬ 
fied. The function ${x,y) is called the Airy stress function, in deference to 
its inventor, the famous astronomer. 

An arbitrary function $(a:, y) generates stresses that satisfy the equa¬ 
tions of equilibrium, but $ is not entirely arbitrary: it is required to gener¬ 
ate only those stress fields that satisfy the condition of compatibility. Since 
the compatibility condition is given by Eqs. (9) or (10), a substitution gives 
the requirement that, in the plane stress case, 


(18) A 


9 4 $ 9 4 $ 9^E> _ _ / d 2 V 9 2 VA 

9a; 4 ^ dx 2 dy 2 + dy 4 \ 9a: 2 dy 2 ) ’ 


and that, in the plane strain case, 


9 4 $ 9 4 $ 9 4 $ (1 - 2v) fd 2 V d 2 V \ 

^ 9a: 4 dx 2 dy 2 9t/ 4 (1 — o) \ dx 2 dy 2 ) 


If the body forces vanish, then, in both plane stress and plane strain, $ is 
governed by the equation 


(20) A 


9 4 $ 9 4 $ 9 4 $ 

9a: 4 dx 2 dy 2 9y 4 


A regular solution of Eq. (20) is called a biharmonic function. Solution 
of plane elasticity problems by biharmonic functions will be discussed in 
the following sections. 
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How about the other five compatibility conditions in Eqs. ( 6 ) left alone 
so far? In the case of plane strain it is clear that they are identically 
satisfied. In the case of plane stress, however, they cannot be satisfied in 
general if we assume a xx , cr yy , a xy to be independent of z. For, under such 
an assumption these compatibility conditions imply that 


( 21 ) 


d 2 e zz __ d 2 e zz 
dx 2 dy 2 


d 2 e zz 

dxdy 


= 0 . 


Hence, e zz , and hence a xx + a yy , must be a linear function of x and y 
\e zz = —v{cr xx +(jyy)/E}, which is the exception rather than the rule in the 
solution of plane stress problems. Hence, in general, the assumption that 
plane stress state is two-dimensional, so that (T xx ,a vv , o xy are functions of 
x, y only, cannot be true; and the solutions obtained under this assumption 
cannot be exact. However, as we have discussed previously (Sec. 9.1), they 
are close approximations for thin plates. 

The stress-function method can be extended to three dimensions. The 
crucial observation is simply that the equation of equilibrium represents 
a vector divergence of the stress tensor. We are familiar with the stream 
function in hydrodynamics. In three-dimensions we need a triple of stream 
functions. Similarly, a generalization of Airy’s procedure to equations 
of equilibrium in three dimensions requires a tensor of stress functions. 
Finzi 81 (1934) showed that a general solution to the equations 


(22) (Tyj - 0 , dy - dj, 

is 


(23) Oij — €'imr^'jn8$ra,7nn j 

where <j> ra stands for the components of a symmetric second-order tensor 
of stress functions, while e imr is the usual permutation symbol (Sec. 2.1). 
Specialization by taking <f> rs = 0 (r ^ s), yields Maxwell’s stress func¬ 
tions, while taking 4> rr = 0 (no sum), yields Morera’s stress function, see 
Sec. 10.9, Eq. (10.9:19), et seq. If all elements <j> r3 , except <j> 33 , are assumed 
to vanish, then (23) degenerates into Airy’s solution of the two-dimensional 
equilibrium equations. An elegant proof of Finzi’s result that is applicable 
to n-dimensional Euclidean space was given by Dorn and Schild . 81 

Finzi 81 (1934) obtained further a beautiful extension to the equations 
of motion of a continuum, with an arbitrary density field, by introducing a 
fourth dimension. Finzi’s arbitrary tensor yields by differentiation a motion 
and a stress field that satisfy the equation of motion. 

For a curved space (non-Euclidean), Tiruesdell 16,1 obtained related re¬ 
sults by methods of calculus of variations. The curved space problem 
arises naturally in the intrinsic theory of thin shells or membranes. The 
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two-dimensional surface is, to a two-dimensional observer who is not allowed 
to leave the surface, a non-Euclidean space (imbedded, of course, in a three- 
dimensional Euclidean space). 


Example 1. The following polynomials of second and third degree are 
obviously biharmonic. 

$2 = a 2 x 2 + b 2 xy + c 2 y 2 , 

# 3 = a 3 x 3 + b 3 x 2 y + c 3 xy 2 + d 3 y 3 . 

By adjusting the constants < 22 , 03 , etc., many problems in which the stresses 
are linearly distributed on rectangular boundaries can be solved. Examples 
of such problems are shown in Fig. 9.2:2. 



Fig. 9.2:2. Examples of problems solvable by simple polynomials. 

Example 2. Consider a rectangular beam (Fig. 9.2:3) supported at the 
ends and subjected to the surface tractions 

on y = c a xy = 0, a yy = —B sin ax , 

on y = —c a xy = 0, a yy — —A sinaa:. 

Other edge conditions are unspeci¬ 
fied at the beginning, and the body 
forces are absent. 

Solution. Let 

$ = sin axf(y ), 

where f(y) is a function of y only. A 
substitution into Eq. (20) yields 

Fig. 9.2:3. A beam subjected to a si- 
a 4 f(y) — 2 a 2 f"(y) + f lv (y) = 0 . nusoidally distributed loading. 





The general solution is 


/(y) = C\ cosh ay + C 2 sinh ay + C 3 y cosh ay + C^y sinh ay. 


Hence, 

$ = sinoa;(C'i cosh ay + C 2 sinh ay + C$y coshay + C^y sinh ay ), 

a xx = sin ax [Cj a 2 coshay + C 2 a 2 sinh ay 

+ C$(ya 2 cosh ay, + 2 a sinh ay) + C^ya 2 sinh ay + 2 a cosh ay)], 

cT yy = -a 2 sin ax\C\ cosh ay + C 2 sinh ay + • • • ], 

a xy = —a cos aa:[(7i a sinh ay + (7 2 a cosh ay 

+ C 3 (cosh ay + ya sinh ay) + C 4 (sinh ay + ya cosh ay)]. 

On application of the boundary conditions a xy = 0 on y = ±c, we can 
express C 3 and C 4 in terms of C\ and C 2 . The other boundary conditions 
then yield the constants 

(A + B) sinh ac + ac cosh ac 

1 a 2 sinh 2 ac + 2 ac 

(A - B) cosh ac + ac sinh ac 

2 a 2 sinh 2 ac — 2 ac ’ 

(A — B) a cosh ac 

3 a 2 sinh 2 ac — 2 ac ’ 

(.4 + B) a sinh ac 

4 a 2 sinh 2 ac + 2 ac 

The details can be found in Timoshenko and Goodier , 1,2 p. 48. 

9.3. AIRY STRESS FUNCTION IN POLAR COORDINATES 

For two-dimensional problems with circular boundaries, polar coordi¬ 
nates can be used to advantage. Let £ r , £ 0 , £ 2 denote physical components 
of the displacement, and let e rr , e r e,.. ■, oy r , a>o,..., etc., be the physical 
components of the strain and stress, respectively. The general equations 
in cylindrical polar coordinates are given in Sec. 4.12. For plane stress 
problems, we assume that 


(1) 


Gzz —* Gzt — GzQ — 0 . 
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For plane strain problems, we assume that 

(2) 6 = 0, 

and that all derivatives with respect to z vanish. In both cases, the strain 
components are defined as follows: 


( 3 ) 


£6 

dr ’ 


eee = 


r dQ 


- ifi?k + _ k 

€t6 2 \ r 86 dr r 


Furthermore, in plane stress, 

(4) t TT = i (<7 rr - VOgg) , egg = -^ I 
and in plane strain, 


6 r 0 


1 + V 


O'rG » 


€rr = E ^ ~ v2 ^ tt ~ V ^ 1 + v 


( 5 ) 


e ee = -g [(1 - v 2 )<7$6 - v(l + v)(?r 
1 + V 

^rO — ^ GrQ j 


and the equations of equilibrium become 


(6) 


1 d{ra TT ) 1 da r6 <j ee = 
r dr r 86 r r ’ 

1 d(r 2 a gr ) , 1 dagg , „ n 

^~8^~ + r~m +Fe=Q - 


If the body forces F r , Fg are zero Eqs. (6) are satisfied identically if the 
stresses are derived from a function $(r, 6): 

_ i i d 2 & 
rr r dr ^ r 2 86 2 ’ 


(7) A 


o> 2 $ 
dr 2 ' 


°r9 = -TT - 


i 


dr \r 88 
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as can be verified by direct substitution. The function <&(r, 6) is the Airy 
stress function. 

The compatibility conditions (9.2:9) and (9.2:10) are, if body forces 
are zero, 

/ d 2 d 2 \ 

(s) (^2 + fyi)( <T ** + tT vv) = 0 - 

The sum a xx + <r yv is an invariant with respect to rotation of coordi¬ 
nates. Hence, 


(9) Oxx "b (Jyy — ^rr T @66 

The Laplace operator is transformed as 
d 2 d 2 d 2 


( 10 ) 


dx 2 ^ dy 2 


id_ i d 2 

dr 2 r dr r 2 30 2 


Hence, on substituting Eq. (7) into Eq. (9) and using Eq. (10), Eq. (8) 
is transformed into 

d 2 l 3 Id 2 


(11) A 


ld_ _1_ 

dr 2 ^ r dr ^ r 2 dd 2 


d 2 $ ia$ 

dr 2 ^ r dr 


0. 


1 

r 30 2 ) ~ ' 

This is the compatibility equation to be satisfied by the Airy stress function 
$(r, 8). If one can find a solution of Eq. (11) that also satisfies the boundary 
conditions, then the problem is solved; since by Kirchhoff’s uniqueness 
theorem, such a solution is unique. 


Axially Symmetric Problems. If $ is a function of r alone and is 
independent of 9, all derivatives with respect to 9 vanish and Eq. (11) 
becomes 

d 4< f> 

dr 4 r dr 3 r 2 dr 2 + r 3 dr 

This is the homogeneous differential equation which can be reduced to a 
linear differential equation with constant coefficients by introducing a new 
variable t such that r = e l . The general solution is 

(13) <E> = A log r + Br 2 log r + Cr 2 + D , 
which corresponds to 

o rT — ^2 + (! + 2 lo S r ) + 2C ) 

(14) <jgg — —j "b B(3 + 2 log r) + 2(7, 
cr r e = 0. 
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The solutions of all problems of symmetrical stress distribution and no body 
forces can be obtained from this. 

The displacement components corresponding to stresses given by 
Eq. (14) may be obtained as follows. Consider the plane stress case. 
Substituting Eq. (14) into the first of the Eqs. (4) and (3), we obtain 


dr 


E 


(1 + f)A 
r 2 


+ 2(1 - v)B log r + (1 — 3i /)B + 2(1 - v)C 


from which, by integration, 


(15) 



( 1 + V ) A +2(l-v)Brlogr 
r 


- B(l + v)r + 2C(1 - u)r 


+m, 


where f{6) is an arbitrary function of 6 only. From Eq. (14) and the second 
of the Eqs. (4) and (3), we obtain 


d$o 

80 


4 Br 


-m- 


Hence, by integration, 


(16) 




where fi(r) is a function of r only. Finally, from the last of Eqs. (14), 
Eqs. (4) and (3), we find, since a r $ = e r6 = 0, 


1 df(9) 

rd0 


dfijr) 

dr 




— 0. 


Multiplying throughout by r, we find that the first and the third term are 
functions of 6 only and the other two terms are functions of r only. Hence, 
the only possibility for the last equation to be satisfied is 


dm 

d0 



= a, 


dfi(r) 


- fi(r) = -a, 


dr 
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where a is an arbitrary constant. The solutions are 

f(Q) = a sin# + 7 cos# , fi{r) = pr + a , 

where a , /3, 7 are arbitrary constants. Substituting back into Eqs. (15) and 

(16) , we obtain, for the plane stress case, 

tr = 1 - i 1 + + 2(1 - u)Br logr 

E r 

(17) — B(1 -f u)r + 2C(1 — v)r + a sin 0 + 7 cos 9 , 

4 DrO 

£0 = —--(- a cos 9 — 7 sin 9 + f3r . 

bj 

The arbitrary constants A, B, C, a, (5 ,7 are to be determined from the 
boundary conditions of each special problem. The corresponding expres¬ 
sions for the plane strain case are 

£ r = 1 u y q. 2 B (1 - u - 2 v 2 )r log r 

E [ r 

(18) + 2 C (1 - v — 2v 2 )r + asin# + 7 cos 0 , 

£0 — —b~ (1 — v 2 ) -f a cos 9 — 7 sin 9 + fir . 

E 

Example 1 . Uniform Pressure Acting on a Solid Cylinder 
The boundary conditions are, at r = a, 

a rr = -p 0 , a r g = 0 . 

There should be no singularity in the solid. A glance at Eq. (14) shows 
that the problem can be solved by taking 

A = B = 0 , C = . 

2 

Example 2 . Circular Cylindrical Tube Subjected to Uniform Internal and 
External Pressure 
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Fig. 9.3:1. Circular tube. 

of the displacements given by Eq. 
ferential displacement £e will have 


For this problem (Fig. 9.3:1), the 
boundary conditions are 

<r rr = — p at r = a , 

(19) 

<r rr — —q at r = b. 

The form of a rr in Eq. (14) suggests 
that the boundary conditions can be 
satisfied by fixing two of the constants, 
say A and C. From Eq. (19) alone, 
however, there is no reason to reject 
the term involving B; but inspection 
(18) shows that if B ^ 0 the circum- 
i nonvanishing term 


( 20 ) 


ABr9 

E 


(1-u 2 ) 


which is zero when 0 = 0 but becomes = (1 — u 2 ) 87 r Br/E when one 
traces a circuit around the axis of symmetry and returns to the same point 
after turning around an angle 27T. Thus the displacement given by (20) 
is not single-valued. Such a multi-valued expression for a displacement is 
physically impossible in a full cylindrical tube. Hence, 5=0. 

It is simple to derive the constants A and C in the expression 

a rr = 4 + 2 C 


such that Eq. (19) is satisfied. Thus we obtain Lame’s formulas for the 
stresses, 

(b 2 /r 2 )- 1 1 - (a 2 /r 2 ) 

<7rr ~ ~ P (6 2 /a 2 ) - 1 ~ q i — (a 2 /6 2 ) ’ 

( 21 ) 

(b 2 /r 2 ) + 1 1 + (a 2 /r 2 ) 

° 69 P (£> 2 /a 2 ) — 1 9 l-(a 2 /6 2 )' 

It is interesting to note that a rr + croo is constant throughout the cylinder. 
If q = 0 and p > 0, a rr is always a compressive stress and oqq is always a 
tensile stress, the maximum value of which occurs at the inner radius and 
is always numerically greater than the internal pressure p. 
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Example 3. Pure Bending of a Curved Bar 
The coefficients A,B,C may be 
chosen to satisfy the conditions a TT = 



at the boundary 9 = const, (see 
Fig. 9.3:2). See Timoshenko and 
Goodier , 1 ' 2 p. 61; Sechler , 1 ' 2 p. 143. 


Fig. 0.3:2. Pure bending of a 
curved bar. 


Example 4. Initial Stress or Residual Stress in a Welded Ring 



Fig. 0.3:3. Welding of a ring. 


If the ends of an opening in a ring, 
as shown in Fig. 9.3:3, are joined to¬ 
gether and welded, the initial stresses 
can be obtained from the expressions 
in Eq. (14) by taking 


( 22 ) 



See Eq. (17) for circumferential dis¬ 
placements. The constants a, /3, 7 are 
all zero. 


Problem 9.1. Determine the constant C in the stress function 
4> = C[r 2 (a — 6) + r 2 sin 6 cos 9 — r 2 cos 2 6 tan a] 
required to satisfy the conditions on the upper and lower edges of the triangular 
plate shown in Fig. P9.1. Determine the components of displacement for points 
on the upper edge. 



P9.1. 
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9.4. GENERAL CASE 

By direct substitution, it can be verified that a solution of the equation 

/ d 2 ld_ 10 $ 1 <9 2 $\ 

y dr 2 r dr r 2 dO 2 J \ dr 2 r dr r 2 3 # 2 y 

is (J. H. Michell, 1899) 

$ = ao log r + bor 2 + cor 2 log r + d 0 r 2 # + a' 0 9 
a" ( 

+ r#sin# H— r + bir 3 + a ',? - " 1 + b' x r logr) cos# 

2 a i 

c" ( 

( 1 ) + ~r9cos6 - r + dir 3 + c\r~ l + d\r log r) sin# 

L C 1 

oo 

+ £(a n r" + b n r n + 2 + a' n r~ n + b' n r~ n+2 ) cosn# 

n =2 

OQ 

+ ^(c„r n +dn ^ n+2 + c' n r~ n + d' n r~ n+2 ) sinn#. 
n =2 

By adjusting the coefficients ao,bo,ai, ..., etc., a number of important 
problems can be solved. The general form of Eq. ( 1 ) as a Fourier series 
in # and power series in r provides a powerful means for solving problems 
involving circular and radial boundaries. The individual terms of the se¬ 
ries provide beautiful solutions to several important engineering problems. 
Many examples are given in Timoshenko and Goodier , 1,2 pp. 73-130 and 
Sechler , 12 pp. 149-171. 

Example 1 . Bending of a Curved Bar By a Force at the End 

p Consider a bar of a narrow rectan¬ 

gular cross section and with a circu¬ 
lar axis, as shown in Fig. 9.4:1, loaded 
by a force P in the radial direction. 
The bending moment at any cross sec¬ 
tion is proportional to sin#. Since 
the elementary beam theory suggests 

that the normal stress agg is propor- 
Fig. 9.4:1. Bending of a curved bar. tiond tQ th(J bending momell t, it is 

reasonable to try a solution for which agg, and hence 4>, is proportional 
to sin#. Such a solution can be obtained from the term involving sin# in 
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Eq. (1). It can be verified that the solution is 

(2) $ = ^djr 3 -f — + d[r logr^ sin# , 


with 


di = 


2 N ’ 


Pa 2 b 2 P . 2 t2 

-, d\ = - (a 2 + b 2 

2 N ’ 1 N K 


where N = a 2 — b 2 + ( a 2 + 6 2 ) log (6/a). Note that the term cirsin# in 
Eq. (1) does not contribute to the stresses and thus is not used here. The 
stresses are 


N 


r + 


a 2 b 2 


+ b 2 


sm 9 , 


( 3 ) 


05 

05 

II 

Zr- — - 

a 2 + 6 2 ^ 

sin0, 

6T r 3 

r ) 

P 

( a 2 b 2 

a 2 + 6 2 N 

| cos 0 

GrB ~ N 

( r+ —- 

r t 


If the boundary stress distribution were exactly as prescribed by the equa¬ 
tions above, namely, 


a rr = a r g =0 for r = a and r = 6 
P 


(4) a »e - 0. °t 6 - N 


a 2 b 2 1 2 2 

r H-^-(a 2 + 6 2 ) 


<Tr9 = 0 , age = 


N 


a 2 b 2 , , 9 . 1 

3 r-3-(a 2 + 6 2 )- 


for 9 = 0 , 

for 6 =-, 
2 ’ 


then an exact solution is obtained. 

A detailed examination of the exact solution shows that a commonly 
used engineering approximation in the elementary beam theory, that plane 
cross sections of a beam remain plane during bending, gives very satisfac¬ 
tory results. 


Example 2. Concentrated Force at a Point on the Edge of a 
Semi-Infinite Plate 

Consider a concentrated vertical load P acting on a horizontal straight 
boundary AB of an infinitely large plate (Fig. 9 . 4 : 2 ). The distribution of 
the load along the thickness of the plate is uniform. The plate thickness is 
assumed to be unity, so that P is the load per unit thickness. 
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This is the static counterpart of Lamb’s problem (Secs. 8.15 and 9.7). 
Its solution was obtained by Flamant (1892) from Boussinesq’s (1885) 
three-dimensional solution (Sec. 8.10). In contrast to the dynamic or the 
three-dimensional cases, the solution to the present case is given simply by 
the Airy stress function 



(5) <f> =- r9sin9, 

7T 

which gives 

2 Pcos 0 

(6) o rr = -, crefl = 0, a r0 — 0 . 

7r r 

From Eq. ( 6 ), it is seen that an element at a distance r from the point of 
application of the load is subjected to a simple compression in the radial 
direction. Equation ( 6 ) shows also that the locus of points where the radial 
stress is a constant —cr rr is a circle r = (—2P/na rr ) cos 9, which is tangent 
to the x-axis, as shown in Fig. 9.4:2. 

Using the relations (9.3:3) and (9.3:4) and Eq. ( 6 ), we can determine 
the displacement field £ r , If the constraint is such that the points on 
the y-axis have no lateral displacement (£0 = 0 when 9 — 0), then it can be 
shown that the elastic displacements are 

2 P (1 - 1 AP 

£ r — -— cos 9 log r - 9 sin 9 + B cos 9, 

'KCj 7T& 

2 l/P 9 P (1 — iAP 

(7) ^ = He S ' me + ^E logrsin *~ t rE 9cos6 

+ £ ~^ P sinfl-.Bsinfl. 
nE 

The constant B can be fixed by fixing a point, say, £ r = 0 at 9 = 0 
and r — a. But the characteristic logarithmic singularity at oo cannot 
be removed. This is a peculiarity of the two-dimensional problem. The 
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corresponding three-dimensional or dynamic cases do not have such loga¬ 
rithmically infinite displacements at infinity. 

PROBLEMS 

9.2. Verify Eqs. (6) and (7) and show that Eq. (5) yields the exact solution 
of the problem posed in Example 2. 

9.3. Find the stresses in a semi-infinite plate (—oo < x < oo, 0 < y < oo) 
due to a shear load of intensity r cos ax acting on the edge y = 0, where r and a 
are given constants. Hint: Consider $ = ( Ae° ,v +Be~°‘ v +Cye a ‘ v +Dye~ av ) sin ax. 

9.4. Show that the function (Mt/2n)6, where M t is a constant, is a stress 
function. Consider some boundary value problems which may be solved by such 
a stress function and give a physical meaning to the constant Mt- 

9.5. Consider a two-dimensional wedge of perfectly elastic material as shown 
in Fig. P9.5. If one side of the wedge (9 = 0) is loaded by a normal pressure 
distribution p(r) = Pr m , where P and m Eire constants, while the other side 
(6 = a) is stress-free, show that the problem can be solved by £m Airy stress 
function 4>(r, 6) expressed in the following form. If m / 0 (m may be > 0 
or < 0), 

(1) <£ = r m+2 [a cos (m + 2)6 + 6 sin (m 4- 2)6 + ccos m9 + d sinm#]. 

If m = 0, 

(2) = Kr 2 f — tain a cos 2 6 + i sin 20 + a — . 



Determine the constants a,b,c,d,K. Discuss the boundedness (i.e., whether 
they Eire zero, finite, or infinite) of the stresses a Tr , (Tee, <r T e\ the slope dv/dr, 
Eind the second derivative dPv/dr 2 , in the neighborhood of the tip of the wedge 
(r —> 0). The symbol v denotes the component of displacement in the direction 
of increasing 9. 

When the wedge single a is small, v is approximately equsil to the verticsil 
displacement. This problem is of interest to the question of curling up of the shsirp 
leading edge of a supersonic wing. [Reference: Fung, J. Aeronautical Sciences, 
20, 9 (1953).] 
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9.6. A semi-infinite plate (—oo < x < oo, 0 < y < oo) is subjected to the 
edge conditions 


go 4 q 0 / 

2 7T 2 V 


7TX 1 37TX 1 57TX , 
CO S T +-CO S — + -COS— + 


■)' 


= 0 , 


on the edge y = 0 (Fig. P9.6). Find the Airy stress function that solves the 
problem. 


k 

M 


k 


X 

y 


P9.6. 


Obtain expression for a xx and cr vy . Sketch o vy as a function of x on the line 
y — d, where d > L. 


9.5. REPRESENTATION OF TWO-DIMENSIONAL 
BIHARMONIC FUNCTIONS BY ANALYTIC 
FUNCTIONS OF A COMPLEX VARIABLE 

It is well-known that the real and imaginary parts of any analytic func¬ 
tion of a complex variable z = x + iy is harmonic, where i = \/—l. Thus, 
if f(z) is an analytic function of z, and u and v are the real and imaginary 
parts of f(z), we have 


( 1 ) 

( 2 ) 

( 3 ) 

( 4 ) 


f(z) = u + iv, 

df df du ,dv ,df . du dv 
dz dx dx ^ * dx 1 dy 1 dy ^ dy' 

du dv du dv 

dx dy ’ dy dx' 

V 2 it = 0, V 2 u = 0, V 2 / = 0, 


where 


V 2 


d 2 d 2 
dx 2 + dy 2 ’ 
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is the two-dimensional Laplace operator. Equation (3) is the well-known 
Cauchy-Riemann relation. 

An alternative proof that an analytic function is harmonic is as follows. 
Let z = x — iy be the complex conjugate of z, then the differential operators 
in the x, y coordinates are related to those in the z, z coordinates by 


(5) 

d 

d d 

9 1 

' d 

d\ 

dx 

dz ^ dz ’ 

dy *( 

K dz 

9z) 

or 

(6) 

d 

d . d 

9 

d 

. 9 

2 — 


2— = 


+ 1 — . 

dz 

dx dy ’ 

dz 

dx 

dy 


Equation (6) gives the Laplace operator in the z, z coordinates as 


CO 


d 2 (9 d \( d ,_s\ _ 9 2 a 

^ dzdz \<9x * dy) \dx + 1 dy ) dx 2 dy 2 


Obviously any analytic function ip of z or z satisfies Eq. (7), i.e., V 2 ip{z) = 0 
and V 2 ip(z) — 0; so do the real and imaginary parts of ip(z) and ip(z). 
Repeating the operation of Eq. (7) gives the biharmonic operator 


( 8 ) 


V 2 V 2 


d 2 d 2 \ ( d 2 d 2 \ d 4 

dx 2 dy 2 ) \dx 2 dy 2 ) dz 2 dz 2 ' 


Thus any bihamornic function can be written in the form of the real or 
imaginary part of zip(z) + x(z) or zip(z ) + x(z)> where ip(z), x(z) are arbi¬ 
trary analytic functions, because 


(9) V 2 V 2 [z-0(z) + x(z)} = V 2 V 2 [z^(z) + x(z)) = 0 


Problem 9.7. 

(a) Show that x — iy is not an analytic function of z = x + iy. 

(b) Determine the real functions of x and y which are real and imaginary 
parts of the complex functions z” and tanh z. 

(c) Determine the real functions of r and 9 which are the real and imaginary 
parts of the complex function zlnz. Note that z = re' 6 . 

(d) z = x + iy, z = x — iy, a = a + i/3, a, = a — i/3, where x, y, a, f3 are real 
numbers. Express the real and imaginary parts of the following func¬ 
tions explicitly in terms of x,y,a,{3 to get acquainted with the notations 
f(z),f(z),f(z) and /(z): 

1 . f(z) = az, f(z) = az, f(z) = az, ]{z) — az. 

2 - f(z) = e iaz , f(z) = e~' az , f[z) = e laz , f{z) = e~ l&z . 

3.f(z)=az n , f(z) = az n , /(z) = az", /(z) = az n . 

Show that the complex conjugate of /(z) is /(z) in these examples. 
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(e) Show that the complex conjugate of f(z) = a n z n is 


/o) = a ^ n ■ 

n=0 

(f) Show that the derivative of / (z) with respect to z is equal to the complex 
conjugate of df(z)/dz. 

(g) Find a function v(x,y) of two real variables x,y, such that ln(x 2 + y 2 ) + 
iv(x,y) is an analytic function of a complex variable x + iy. (Use the 
Cauchy-Riemann differential equations). 


9.6. KOLOSOFF-MUSKHELISHVILI METHOD 

The representation of biharmonic functions by analytic functions leads 
to a general method of solving problems in plane stress and plane strain. 

We consider a region — simply or mul¬ 
tiply connected — on the x , y-plane 
bounded by a number of contours 
(Fig. 9.6:1). The interior of the region 
is considered to represent a disk of unit 
thickness. Surface traction is applied 
on the boundaries of this region. Body 
forces will be assumed to be zero to 
simplify the general solution. If body 
forces actually exist they can be repre¬ 
sented by a particular solution so that 
the general problem is reduced to one 
without body force. 

On introduction of the Airy stress 
function $(x,y) (Sec. 9.2), the plane 
stress and plane strain problems are 
reduced to the solution of a biharmonic equation subjected to appropriate 
boundary conditions. According to the results of Sec. 9.5, the Airy stress 
function <h(a:, y) can be written as 

( 1 ) ▲ 2 $(x, y) = 2 Rl[z^(z) -I- *(z)] = zip(z) + z$(z) + x(z) + x(z) 

where Rl(.) denotes the real part of the quantity in the parenthesis, and 
tp(z) and x( z ) are analytic functions, called the complex stress functions. 
The formula ( 1 ) is due to the French mathematician Goursat. 

The problem in plane elasticity is now simply to determine the complex 
stress functions ip(z) and x{ z ) that satisfy the boundary conditions. For 



Fig. 9.6:1. Notations. 
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this purpose, we must express all the stresses and displacements in terms 
of ip(z) and x( z )- We shall show that 

( 2 ) A a xx + CFyy = 2 + 4>'{z )\, 

(3) A (Tyy - a xx +2ia xy = 2[zip"(z) + x"(z)\, 

( 4 ) A 2 G(u + iv) = Kip(z) — zijj'(z) — x'{ z ) > 

where a prime denotes differentiation with respect to the independent vari¬ 
able, i.e., i//(z) = d , 4)(z)/dz and i p'{z) = d%jj(z)/dz, 


(5) 

1 

CO 

II 

for plane strain 


3 — v 

for plane stress, 

(6) 

l + i/ 


and v is Poisson’s ratio. In addition, we have the following expressions 
for the resultant forces F x , F y and the moment M about the origin of 
the surface traction on an arc AB of the boundary from a point A to a 
point B: 

(7) A F x +iF y = (t x + i T^j ds = -ity(z) + zi/>'(z) + x'{ z )\a 

(8) M = Rl[x(z) - zx'(z) - zz&(z)]5 , 


The physical quantities <y xx , <r yy , o xy , u, v, etc., are of course real¬ 
valued. If the complex-valued functions on the right-hand side of Eqs. (2)- 
(4) are known, a separation into real and imaginary parts determines all 
stresses and displacements. 

The derivation of these relations is as follows. According to Eqs. (1) 
and (9.2:17), for vanishing body force, we have 


(9) 

( 10 ) 
( 11 ) 


<9 2 $ d 2 4> d 2 $ 

+ -»=9? + V =4 ^^ 2lW+# ' (J)l ' 


. d (d$ ,d$\ n . d d$ 

<r xx - ia xy - l g^[ dx ~ l dy ) ~ 2l dy dz ■ 

d / 9 $ d$\ 0 9 $ 

+ l dy)~ 2 dxdz' 


Taking the difference of Eqs. (10) and (11) and using Eq. (1) lead to Eq. (3). 
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To derive Eq. (4), we have to consider plane stress and plane strain 
separately. For plane stress, Hooke’s law states 


( 12 ) 

(13) 


du i i (d 2 § d 2 $\ 

~ dx~ e ((Txx ~ v ° vv) - e ~ "dtf ) ’ 
. 1 (d 2 ® d 2 $\ 

U<Txx) = E{d^~ V W)’ 


dv 1 
= dy = E 


a\ 1 (du dv\ 1 

(14) exy -2\fy + di)-2G a: 


Note that 


dtjj{z) l d'tp(z) d 
dz 


dz 

( 16 ) + d ^ 


1 (d 2 § d 2 §' 

V dy 2 

1 d 2 $ 

2 G dxdy 

d_ 

' dx 


dz 


dz 


dx 

i^[V<(x + iy) - $(x - iy)\ = 2-^Im(V0- 


where Rl(.) and Im(.) denote the real and imaginary parts of the quantity 
in the parenthesis. From Eqs. (9) and (15) and from Eqs. (9) and (16), we 
have 


^ dy 2 4 dx R ^ dx 2 ’ 


d 2 $ a. ... d 2 $ 

= V ra w-^r’ 


dx 2 


respectively. A substitution of Eq. (17) into Eq. ( 12 ) yields 


du 1 
dx E 


' d d 2 $ 

4 S MW-(l+.) Hr 


An integration with respect to x gives 
1 


(18) 


4Rl(tf)-(l + i/)|l 


+ /(y). 


where /(?/) is an arbitrary function of y. Similarly, from Eq. (13), we obtain 


(19) 


1 

v = — 
E 


d$ 

4Im(V») - (! + z/ )^ 


+ y(*), 


where y(x) is an arbitrary function of x. Substituting Eqs. (18) and (19) 
into Eq. (14), and noticing that dRl(il>)/dy = —dlxn(ip)/dx we obtain 


f'{y) + 9'{x) = 0 


Hence, f(y) = ay + / 3, g(x) = —ax + 7 , where a, /?, 7 are constants. The 
forms of / and g indicate that they represent a rigid motion and can thus be 
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disregarded in the analysis of deformation. If we set / = g = 0 in Eqs. (18) 
and (19), using 


9 $ . 9 $ 9 $ 
dx ^ 1 dy ^ dz 


= Ip{z) + zip'(z) +x'(z) 


and combine them properly, we obtain Eqs. (4) and (5). The plane strain 
case is similar, and Eq. (6) can be obtained without further analysis by 
applying a rule derived in Sec. 9.1, namely, replacing u in Eq. (5) by an 
apparent value vj(\ — v). 


T r ds 




Fig. 9.6 :2, Notations. 


It remains to prove formulas (7) and (8). Refer to Fig. 9.6:2. Let AB 
be a continuous arc, ds be an arc length, v be the unit vector normal to 

V U 

the arc, and T x , T y be the a:’s and y’s components of the traction acting on 
the arc. We note that 


(20) 

dy 

dx 

cos a = — , 
ds 

sm a = —— , 
ds 

(21) 

T x — <J xx cos a + cr xy sin a , 

V 

Ty — (Tyy S^ O + ( 7; 


Hence by Eq. (9.2:17), 

(22) £ = d 2 <S> dy 9 2 $ dx _ d / 0 $\ " = 

' ' x dy 2 ds dxdy ds ds \dy J ’ y ds \dx 

Therefore, the resultant of forces acting on the arc AB can be written as: 

i-B 


F x + iF y = 


y ~ I (T x 
J A 


+i T y ) ds = 


9$ _ 9£ 

dy dx 


iB 

A 


(23) 


9$ .9$ 

dx * dy 


= -2 i 
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which is Eq. (7). The moment about the origin O of the forces acting on 
the arc AB is 


( 24 ) 


M 


r a v v ( d 

= J ^ (x Ty +y T x ) [X- 

r B fd$dx a$d y \ , r a$ 

- J A \dx ds + d y ds) S ~ f dx +V dy 

= $\a - Ri 


d d§\ , 

d^ + y Tsd^) ds 


2i n 


A substitution of Eq. (1) into Eqs. (23) and (24) verifies Eqs. (7) and (8). 

Equations (7) and (8), applied to a closed contour show that if ip(z) 
and x( z ) are single-valued, the resultant forces and moment of the surface 
traction acting on the contour vanish, since the functions in the brackets 
return to their initial values when the circuit is completed. If the resultant 
force of traction acting on a closed boundary does not vanish, the value of 
the functions in the brackets must not return to their initial values when 
the circuit is completed. For instance, the function lnz = lnr + i# does not 
return to its original value on completing a circuit around the origin, since 
0 increases by 27T. Thus, if ^>(z) = Clnz, or x(^) = .Dzlnz, where C and 
D are complex constants, Eq. (7) will yield a nonzero value of F x + iF y . 
Similarly, x( z ) = D\nz yields a nonzero value of M according to Eq. (8) if 
D is imaginary, but a zero value if D is real. 

The examples below will show that a number of problems in plane stress 
and plane strain can be solved by simple stress functions. By reducing the 
problem into the determination of two analytic functions of a complex vari¬ 
able, Muskhelishvili and his school have devised several general methods of 
approach. The principal tools for these general approaches are the confor¬ 
mal mapping and Cauchy integral equations. The details are well recorded 
in Muskhelishvili’s books. 


Example 1. Circular hole in a large plate 

Consider a large plate with a circular hole of radius R at the origin. The 
plate is subjected to uniform remote stresses <7 xx o, <x yy o, <x xy a- The traction 
on the surface of the hole is zero. From Eq. (7), the traction free condition 
can be written as 


- i(F x - iF y ) = tp(z) + z 


dij){z) 


+ tp(z) = 0, 


( 25 ) 


dz 
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at z = Re* 0 , where (p is a complex stress function defined as 

d-X(z) 


(26) 


ip(z) 


dz 


which is introduced to save writing. If we define 


(27) 


1 P( z ) = 




j(R 2 \ R 2 dip(z) 




dz 


then the resultant force on a circular arc connecting points P and Q is 

R 2 \ dVW |Q 


i(F x iFy) 


V>(I 


t (R 2 \ 

+ {t) 


+ 2 


z ) dz 


which is zero at the hole ( zz = B?) for arbitrary ip(z). 

We can use the remote stress conditions to determine ip(z). Assume 
that 

(28) ip = Az H-f + • • • , 

where A, B, C, ... are complex constants. Prom Eqs. (2) and (3), we have 

G XX + CTyy = + 



For O xx d~ U yy — &xxO d" UyyO and (Xyy @xx d~ 2z(Ta,y — &yyO &xxQ d - 

as z approaches infinite, we obtain 


(29) A+A= - xx0 p^ , B = -( j y ~ y0 -ia xy ^jR 2 , C=--- = 0. 

The imaginary part of A is indeterminate since ip = Im(A)z represents a 
rigid body motion, and can be set to zero in the analysis of deformation. 
Thus, we have 


ip = 


&xx0 d - &yyQ 


G yyO &xxO 


'i'CT xyO 


& 

Z ’ 
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ip(z) can be calculated from Eq. (27) and then 

Oxx + CTyy _ n A 

2 _ z 2 f 2 ’ 

Oyy &xx d“ 2 ia xy _2 B ( nA Bz 2 3 B\ R 2 

2 =z 1* + \ 2A ~1F~^)^ 2 - 

At z — ±R, one can show that a yy = 3<r yy o — <r xx o, which is the well-known 
result of stress concentration at a hole in a large plate for a xx o = 0. 

Example 2. Interaction of an elliptical hole and a concentrated load 
(Tong 1984) 

Consider a large plate with an elliptical hole of major and minor axes a, b 
along the x, y axes at the origin. A concentrated force Fo{— F x o + iF y o) is 
applied at z = z 0 . We introduce the transformation 

( 30 ) z = r(c+ T ^\, where R = < ~~ , m =-— \ 
v ' VC/ 2 a+b 

which maps the unit circle at the origin of the C-plane to the ellipse in the 
z-plane. Similar to Eq. (25), the traction free condition on the ellipse can 
be written as 

(si) -i(F x - iF v )=m+mf z ~± +v(o = o 


on the unit circle in the C-plane where 



then, according to Eq. (7), the resultant force on an arc AB is 

(33) -HF. - iFy) = *0-^(1) + [,-(<) - J (1)] 

which equals to zero on the unit circle for arbitrary 

We shall choose ip(() to give a net force Fq at z = zo (or C = Co) and 
zero stress at infinite. Assume ip(() in the form 


(34) 


V’(C) — -41n(C ~ Co) + V'i(C) 
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where is analytic and bounded outside the unit circle. Prom Eq. (4), 


2 G(u - iv) = Kip(() + ^ ) - 


*(0 - * (£ 


dC dip(z) 
dz d( 


— «[.Aln(C — Co) + Vh(C)] + >lln — Co \ + ipi ^ 


*(C) - 


a) 


dc 


+ V’i(C) 


dz [C-Co 

To assure that u — iv is single-valued and has no singularity other than 
In |C — Co| outside the unit circle, we require that 


ipi ( ^ = *L41n(C - Co) + 




'fl l 


ra\ 

(!) 


^ 1 (0 = «Aln(i-Co)+ r ^ 


/.ml 

( < 0 + c 35 ~ 


mCo 


i _ 

Co 2 


Clearly V>i(C) is analytic and bounded outside the unit circle. To determine 
>1, we examine Eq. (33) in a small neighborhood around Co, i.e., C = Co + 
Se ie , 

F x o 4- iF y0 = -i[(^lnd + i9) - «(/llnd - iff) + f(6, 0)]| 2,r = 27t(k + l)A 
where f(5,9) is a single-valued function related to ip \. Therefore 

A =-J±_. 

27t(k. + 1) 

One can then evaluate all the stresses and displacements according to 
Eqs. (2)-(4). 

For b = 0, m equals 1 and the ellipse becomes a crack of length 2a. In 
this case 

dC 2C 2 1 


dz a(C 2 - 1) vTi 


■ 0 ( 1 ), 


as r -> 0, where re 10 = z^f a, which gives rise to a l/y'r stress singularity 
at the crack tips z = ±a or C = ±1- The stress intensity factors, a measure 
of the intensity of the singularity, are defined as 


(35) 


K[-iKu= lim < - (z 2 - a 2 ){o yy - ia x 

|z|->a+ V a 


= lim 2,/— (z 


dip hr dip 

a )~T — 2 \~ lim 37 
dz V a ICI-+1+ dC 
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which is proportional to the coefficients of the terms of singular stress. In 
the equation above, z and ( approach the limit along the real axis. The 
1 /y/r stress singularity at crack tip is the fundamental characteristics of 
linear fracture mechanics. Interested readers are referred to books on the 
subject. From Eqs. (34) and (35), we obtain 

/I K \ f- , Co(CoCo-l)(Co-Co) c, 

\wrw«r 0+ c.(w.ra-D Fo J • 

PROBLEMS 

9.8. Discuss the conformal mapping specified by £ = |(z + A). Obtain the 
inverse transformation. This transformation is the basis of airplane wing theory. 

9 . 9 . Consider a multiply connected region in the x, y-plane. Express the 
condition for the single-valuedness of displacements in terms of stresses. 

9 . 10 . If d$/dx + id$/dy = ip(z) + zip'(z) + x'(z) is to be single-valued, how 
arbitrary are the functions i p(z), x( z )i 

9 . 11 . Consider a doubly connected region R bounded by two concentric cir¬ 

cles. If the stress is single-valued in R, how arbitrary are the functions ip(z),x( z ) 
in the Airy stress function? If both stresses and displacements are single-valued 
in R, how arbitrary are ip(z) , x(z)? [Muskhelishvili, 12 p. 116-128.] 

9 . 12 . Show that, if all stress and displacement components try and Ui vanish 

on an arc AB, then the stresses and displacements vanish identically in the entire 
region R containing the arc AB. [Muskhelishvili, 1,2 p. 132.] 

9 . 13 . Show that the transformation z = defines a transformation from 
rectangular coordinates to polar coordinates. 

9 . 14 . A ring (a < r < b) is subjected to uniformly distributed shearing stress 
in the circumferential direction on the inner and outer surfaces as in Fig. P9.14. 
The couple of the shearing forces over the circumference is of magnitude M. De¬ 
termine the stress and displacement components by means of complex potentials 

ip(z) = 0, x(z) = Alog z, 
where A is a constant which can be complex-valued. 


Ki + iK n = 


1 


(1 + «)- v /S? 



P9.14. Twisting of a disk. 


P9.15. Pure bending of a curved bar. 
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9 . 15 . A curved bar is subjected to a bending moment M at each end. The 
bar is defined by circular arcs of radius a and b and radial lines with an opening 
angle a, (a < 27 t) as in Fig. P9.15. Show that the problem can be solved by 
complex potentials of the form 

xp(z) = Az log z + Bz , 

X(z) = C\ogz. 

Determine the constants A, B, and C. 

9 . 16 . Consider a rectangular plate of sides 21 and 2 L, and small thickness 
2h. Various loads are applied on the edges, Fig. P9.16. There is no body force. 
Determine the stresses and displacements in the plate with the suggested stress 
functions ift(z) and y(z). Express the constants a, b in terms of the external loads. 







0 

i 

L 


1 



'/ 

Z//A 




(b) 


A 

l __ . 

1 

1 



1 

1 ZL 



7 

P 


Fig. P9.16. 


(a) All-round tension, Fig. P9.16(a): 

V’(z) = az , x(z) = 0. 

(b) Uniaxial tension at an angle a to the x-axis, Fig. P9.16(b): 

ip(z) = az, x'( z ) = —2aze~ 2,a . 

(c) Pure bending, Fig. P9.16(c): 

ip(z) = aiz 2 , x(z) = -^aiz 3 - 
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(d) Bending with shear, Fig. P9.16(d): 

•tp(z) = 2 aiz 3 , x!{ z ) = _ 4ai(z 3 + 6 zb 2 ). 

Note: In comparison with the bending of a beam in the three-dimensional 
case discussed in Sec. 7.7, the example in (d) shows how much simpler is the 
two-dimensional problem. 

9.17. Consider plane-stress or plane-strain states, parallel to the x, y-plane, 
of elastic bodies occupying regions bounded by circles. Determine the stresses 
and displacements for the following physical problems with the suggested stress 
functions (Fig. P9.17): 



<#> 

Fig. P9.17. 


(a) Cylinder under interned pressure pi and external pressure p 2 
[Fig. P9.17(a)] 


i/i(z) = Az , 


X(z) = f . 
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(b) A disk of radius a rotating about its axis at a constant angular speed u> 
[Fig. P9.17(b)]: the centrifugal force constitutes a body forces per unit 
mass = oj 2 r, which has a potential V = — ~uj 2 r 2 . Find a particular 
integral to Eqs. (9.2:18) and (9.2:19). 

(c) Rotating hollow disk [Fig. P9.17(c)], Solve by combining (a) and (b). 

(d) Large plate under tension and containing an unstressed circular hole 
[Fig. P9.17(d)]: 



Show that A = i a 2 T, B = -± Ta 2 , C = | To 4 . 

(e) Large plate containing a circular hole under uniform pressure 
[Fig. P9.17(e)]: 

V’(2) = 0, *'(*) = 
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VARIATIONAL CALCULUS, 
ENERGY THEOREMS, 
SAINT-VENANT’S PRINCIPLE 


There are at least three important reasons for taking up the calculus of 
variations in the study of continuum mechanics. 

1. Because basic minimum principles exist, which are among the most 
beautiful of theoretical physics. 

2. The field equations (ordinary or partial differential equations) and 
the associated boundary conditions of many problems can be 
derived from variational principles. In formulating an approxi¬ 
mate theory, the shortest and clearest derivation is usually obtained 
through variational calculus. 

3. The computational methods of solution of variational problems are 
the most powerful tools for obtaining numerical results in practical 
problems of engineering importance. 

In this chapter we shall discuss several variational principles and their 
applications. 

A brief introduction to the calculus of variations is furnished below. 
Those readers who are familiar with the mathematical techniques of the 
calculus of variations may skip over the first six sections. 

10.1. MINIMIZATION OF FUNCTIONALS 

The calculus of variations is concerned with the minimization of func¬ 
tionals. If u{x) is a function of x, defined for x in the interval (a, b), and if 
7 is a quantity defined by the integral 



then the value of I depends on the function u(x) as a whole. We may 
indicate this dependence by writing I[u(x)}. Such a quantity I is said to be 
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a functional of u(x). Physical examples of functionals are the total kinetic 
energy of a flow field and the strain energy of an elastic body. 

The basic problem of the calculus of variations may be illustrated by the 
following example. Let us consider a functional J[u\ defined by the integral 


(1) 


J H = J F(x, u, u') dx. 


We shall give our attention to all functions u(x) which are continuous and 
differentiable, with continuous derivatives u'{x) and u"(x) in the interval 
a < x <b, and satisfying the boundary conditions 


(2) u(a) = u 0 , u{b) = u\ , 

where uq and ui are given numbers. We assume that the function F(x, u, u') 
in Eq. (1) is continuous and differentiable with respect to x, and all such 
u, and u', up to all second-order partial derivatives, which are themselves 
continuous. 

Among all functions u(x) satisfying these continuity conditions and 
boundary values, we try to find a special one u(x ) = y(x), with the prop¬ 
erty that J[u] attains a minimum when u(x) — y(x), with respect to a 
sufficiently small neighborhood of y(x). The neighborhood (h) of y{x) is 
defined as follows. If h is a positive quantity, a function u(x) is said to lie 
in the neighborhood ( h ) of y(x) if the inequality 

(3) \y(x) - u(x)\ < h 

holds for all x in (a,b). The situation is illustrated in Fig. 10.1:1. 



Let us assume that the problem posed above has a solution, which 
will be designated by y(x); i.e., there exists a function y(x) such that the 
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inequality 

(4) J[y] < J[u] 

holds for all functions u(x) in a sufficiently small neighborhood (h) of y(x). 
Let us exploit the necessary consequences of this assumption. 

Let Tj(x) be an arbitrary function with the properties that tj(x) and its 
derivatives rj (x), r)"(x) are continuous in the interval a < x <b, and that 

(5) r)(a) = T](b) = 0. 

Then the function 


(6) u(x) = y(x) + eq(x) 

satisfies all the continuity conditions and boundary values specified at the 
beginning of this section. In fact, any function u(x) satisfying these con¬ 
ditions can be represented with some function r)(x) in this manner. For a 
sufficiently small <5 > 0, this function u(x) belongs, for all e with |e| < S, to 
a prescribed neighborhood ( h ) of y(x). Now we introduce the function 

(7) (j>(e) = J[y + c»?]= J F[x, y{x) + er)(x),y'(x) + er)'(x)\ dx. 

Since y(x) is assumed to be known, <p(e) is a function of e for any specific 
7}(x). According to (4), the inequality 

(8) 4>{Q) < <A(e) 

must hold for all e with |e| < <5. In other words, <f>(e) attains a minimum at 
e = 0. The function <j>(e) is differentiable with respect to e. Therefore, the 
necessary condition for </>(e) to attain a minimum at e = 0 must follow, 

(9) *'(0)=0, 

where a prime indicates a differentiation with respect to e. Now, a differ¬ 
entiation under the sign of integration yields 

(10) <£'(e) = f [F u (x, y+erj, y'+cr]')r](x)+F u i (x, y+erj,y 1 +er}')r)'(x)} dx. 

J a 

where F u , F u > indicates dF/du, dF/du', respectively. Integrating the last 
integral by parts, we obtain 

eb 


/ b f d 

Fv.(x, y + ey, y' + erf) - F u > {x, y+ erj, y'+erj') 


r)(x) dx 


+ F u . (x, y + €7], y' + eq')rj(x) 
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The last term vanishes according to Eq. (5). 4>'{e) must vanish at e = 0, at 
which u equals y. Hence, we obtain the equation 

(12) 0 = 4>'(0)= J F v {x,y,y') - -~F y ,(x,y,y') rj(x)dx, 

which must be valid for an arbitrary function r)(x). 

Equation (12) leads immediately to Euler’s differential equation, by 
virtue of the following “fundamental lemma of the calculus of variations:” 

Lemma. Let ip(x) be a continuous function in a < x < b. If the relation 

(13) l rp(x)rj(x) dx = 0 

holds for all functions tj(x) which vanish at x = a and b and are continuous 
together with their first 2 n derivatives, where n is a positive integer, then 
ip(x) = 0. 

Proof. This lemma is easily proved indirectly. We shall show first that 
ip{x) =0 in the open interval a < x < b. Let us suppose that this statement 
is not true, that ip(x) is different from zero, say positive, at x = £, where £ 
lies in the open interval. Then, according to the continuity of ip(x), there 
must exist an interval ^ + 5 (with a < £ — 8, £ + S < b, S > 0), 

in which ip{x) is positive. Now we take the function 

[ (x - £ + 6) in (x - £ - S) 4n m£-6<x<t + S, 

(14) 7](x) = < 

1 0 elsewhere, 


where n is a positive integer. (See Fig. 10.1:2.) This function rj(x) satisfies 
the continuity and boundary con¬ 
ditions specified above. But the 
choice of r](x) as defined by Eq. (14) 
will make J ^ ip(x) t](x) dx > 0, 

in contradiction to the hypothesis. 0 “<J f +d b 



Thus, the hypothesis is untenable, 
and if(x) = 0 in in a < x < b. 


Fig. 10.1:2. The function y(x). 


According to the continuity of tp(x), we get ip(x) = 0 also in a < x < b. 


Q.E.D. 


This lemma can be extended to hold equally well for multiple integrals. 
From Eq. (12), it follows immediately from the lemma that the necessary 
condition for the functional J[y + erj\ in Eq. (7) to have an extremum is 
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that the factor in front of rj(x) must vanish: 


(15a) ▲ F v (x,y,y') - F y >(x,y,y') = 0, a<x<b. 

ax 

This is a differential equation that y(x) must satisfy and is known as Euler’s 
equation. Written out in extenso, we have 


(15b) ▲ 


d 2 y 8 2 F 
dx 2 dy'dy' 

d 2 F 

dy'dx 


(x,y,y') + 
(x,y,y') - 


dy d 2 F 
dx dy'dy 


( x,y,y') 


dF 

dy 


{ x ,y,y') = o. 


Should the problem be changed to finding the necessary condition for 
J[u\ to attain a maximum with respect to a sufficiently small neighborhood 
(h) of y(x), the same result would be obtained. Hence, the result: The 
validity of Euler’s differential Eq. (15) is a necessary condition for a func¬ 
tion y(x) to furnish an extremum of the functional .J[u\ with respect to a 
sufficiently small neighborhood (h) of y(x). 

The satisfaction of Euler’s equation is a necessary condition for J[u\ to 
attain an extremum; but it is not a sufficient condition. The question of 
sufficiency is rather involved; an interested reader must refer to treatises on 
calculus of variations such as those listed in Bibliography 10.1. * 

Now a point of notation. It is customary to call er](x) the variation of 
u(x) and write 


(16) er;(x) = 5u(x). 

It is also customary to define the first variation of the functional J[tt] as 

(17) <5J = e<j>'{e ). 


On multiplying both sides of Eq. (11) by e, it is evident that it can be 

t Any consideration of the sufficient conditions requires the concept of the second vari¬ 
ations and the examination of its positive or negative definiteness. Several conditions 
are known; they are similar to, but more complex than, the corresponding conditions for 
maxima or minima of ordinary functions. It is useful to remember that many subtleties 
exist in the calculus of variations. A physicist or an engineer rarely worries about the 
mathematical details. His minimum principles are established on physical grounds and 
the existence of a solution is usually taken for granted. Mathematically, however, many 
examples can be constructed to show that a functional may not have a maximum or a 
minimum, or that a solution of the Euler equation may not minimize the functional. An 
engineer should be aware of these possibilities. 
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written as 

(18) SJ = J ^F u (x,u,u') -F u >(x,u,u') 

+ F u ' ( x , u, u')Su(x) 


8u(x) dx 


This is analogous to the notation of the differential calculus, in which 
the expression df = ef'(x) for an arbitrary small parameter e is called 
the “differential of the function /(x).” It is obvious that SJ depends on the 
function u(x) and its variation 6u(x). Thus, a necessary condition for 
J[u] to attain an extremum when u{x) = y(x) is the vanishing of the first 
variation SJ for all variations Su with 6u(a) = Su{b) = 0. 


Example 1. 

fb 

J[u] = / (1 +u' 2 )dx = min, u(a) = 0 , 
J a 


<b) = 1. 


The Euler equation is 
d 


d 


2y / = 2y" — 0. 


1. 


dx v dx 

Hence y(x) is a straight line passing through the points (a, 0) and (6,1). 

Example 2. 

Which curve minimizes the following functional? 

7[u] — J ((u ( ) 2 — u 1 ] dx , u(0) = 0, u n'j 

Ans. u = y = sinx. 

Example 3. Minimum surface of revolution. 

Find y(x) that minimizes the functional 

rb _ 

I[u\ = 2t r / u\J\ + u' 2 dx , u(a) = A , u (b) = B . 
i4ns. The Euler equation can be integrated to give 

yy° 


y\! l +y ' 2 


y/l + y 12 


Cl . 
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Let y' = sinh t, then y = cj cosh t and 



cj sinh t dt 
sinht 


ci dt , 


x = C\t + c 2 . 


Hence, the minimum surface is obtained by revolving a curve with the 
following parametric equations about the x-axis, 


x = c\t + C 2 , y — c\ cosh t, or y — c\ cosh -—— , 

Cl 

which is a family of catenaries. The constants ci, c 2 can be determined 
from the values of y at the end points. 


10.2. FUNCTIONAL INVOLVING HIGHER DERIVATIVES 
OF THE DEPENDENT VARIABLE 

In an analogous manner one can treat the variational problem connected 
with the functional 

rb 

(1) J[u] = J F(x,u,u^\ ... ,u^)dx 

involving u(x) and its successive derivatives u^(x),u^ 2 \x), (x), for 
a < x < b. To state the problem concisely we denote by D the set of all 
real functions u(x) with the following properties: 


(2a) 

u(x), . 

. . , u< 2n > (x) continuous in a < x < b, 


(2b) 

u(a) — 

at 0 , u^(a) = a u , 

v = 1 ,... ,n - 1 , 

(2c) 

u(b) = 

Po 1 u^\b) = p v , 

v = 1 ,..., n - 1 , 


where cco, /3o, a„ and /3 U are given numbers. A function u(x) which possesses 
these continuity properties and boundary values is said to be admissible 
or in the set D. We assume that the function F(x, u, ..., «W) has 
continuous partial derivatives up to the order n with respect to the n + 2 
arguments x, u, ..., u( n ) for u in the set D. We seek a function u(x) = 
y(x) in D so that J\u] is a minimum (or a maximum) for u(x) = y(x), with 
respect to a sufficiently small neighborhood ( h ) of y(x). 

Let rj(x) be an arbitrary function with the properties 

r](x), rjW (x),..., 7 j( 2n l (x) continuous in a < x < b, 
r](a) = • • ■ = r) ( n- 1 )(a) = 77 ( 6 ) = ■ • • = 7 ?( n_ 1 )( 6 ) = 0 . 


(3) 
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Then the function u(x) = y(x) + erj(x) belongs to the set D for all e. For 
sufficiently small e’s this function belongs to a prescribed neighborhood ( h ) 
of y(x). Once again, we introduce the functional 


rb 

(4) <j>(e) = J[y + erj] — / F(x,y + + erj^,... ,y l ' n ' > + ey^) dx. 

The assumption that y(x) minimizes ,J[u] leads to the necessary condition 
= 0. So we obtain by differentiating under the sign of integration, 
integrating by parts, and using Eqs. (3), the result 


o=m = f{£ 

(5) ° / ° 


d 

ch/(") 


F(x,y,y 


(i) 


,y 


(«)i 


rf u \x) ^ dx 




F{x,y,y i ' 1 >,...,y (n >) 


r)(x) dx , 


which holds for an arbitrary function y(x) satisfying Eq. (3). According to 
the lemma proved in Sec. 10.1, we obtain the Euler equation 


( 6 ) A 


n M 

i /=0 


dy( v ) 


F(x,y,y 


(i) 



which is a necessary condition for the function u = y(x) to minimize (or 
maximize) the functional J[u\ given in Eq. (1). 

The notations du, 5J, etc., introduced in Sec. 10.1, can be extended to 
this case by obvious changes. 


Example 

Find the extremal of J[u} = f 0 (1 + u" 2 )dx satisfying the boundary 
conditions 


u(0) = 0, u'(0) = l, u(l) = l, u'(l) = l. 

Ans. u = y(x) — x. 


10.3. SEVERAL UNKNOWN FUNCTIONS 

The method used in the previous sections can be extended to more 
complicated functionals. For example, let 


( 1 ) 


J[ui,u 2 , 


= J>- 


Uj, 


I dx 



Sec. 10.3 


SEVERAL UNKNOWN FUNCTIONS 


321 


be a functional depending on m functions ui(x),..., u m (x). We assume 
that the functions F(x, uj,..., u' m ), u\{x), ..., u m (x) are twice differen¬ 
tiable, and that the boundary values of ixi,... ,u m are given at x = a and 
b. We seek a special set of functions u fi (x) = y M (x), fi = 1,... ,m, in order 
that J[ui ,..., u m ] attains a minimum (or a maximum) when u^x) — y^(x), 
with respect to a sufficiently small neighborhood of the y M (x); i.e., for all 
u M (x) satisfying the relation 

\y^( x ) - u M (x)| < , h li > 0 , y. = 1 , 2 ,..., to. 

Again it is easy to obtain the necessary conditions. Let the set of 
functions y\(x),... ,y m (x) be a solution of the variational problem. Let 
T)i(x),. .., T)m(x) be an arbitrary set of functions with the properties 


( 2 ) 


rj fl (x),r]^(x),r]”(x) continuous in a < x < b, 

Vn(a) = = 0 , 


Then consider the function 


fi = 1,..., m . 


(3) <£(€!,..., Cm ) — J[yi+eiVu---,Vm+e m Vm] 

= f F(x,yi+eiT)x,...,y m + e m r] rn ;y , 1 +€iT]' l ,...,y , rn + e m T]' m )dx. 
J a 


Since yi{x),... , y m (x) minimize (or maximize) J{y\ + erji,... ,y m + e m r] m }, 
the following inequality must hold for sufficiently small cj,... ,e m : 


(4) ><t>{ 0, ...,0), [or <j>(e u ... ,e m ) < 0(0,..., 0)]. 

The corresponding necessary conditions are 


(5) 


d<j> 

de u 


= 0, 


for Cl = £2 = • • • = €m = 0 , 


y = 1 ,..., m , 


which, again with F Vu , F y ^ denoting dF/dy u , dF/dy' u , lead to 

(6) 0= f \Fy li {x,yi,...,y m ,y[,...,y' m )'q lt {x) 

J a 

+ F yl (x, yi ,..., y m , y[, ■ ■ ■, y' m Wn(x)\ dx , 


y = 1,..., m. 
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Using integration by parts and the conditions (2), we obtain 

F y „{x,yi,... ,ym,y[, ■ ■ ■ ,y' m ) 

- ix Fy, J x ’ yu " ■ ’ 2/7,1 ’ y' 1 ' ■ • • ’ y ” 1 ^ 11 ^ dx 

b 

+ F v ^(x,y u ...,ym,y'u---,y , m )Vn( x ) > > TO ■ 

Equation (7) must hold for any set of functions r]^(x) satisfying (2). By the 
lemma of Sec. 10.1 we have the following Euler equations which must be 
satisfied by y\{x),..., y m {x) minimizing (or maximizing) the functional (1): 

( 8 ) A Fy it (x,y 1 ,...,y m ;y' 1 ,...,y' m ) 

- ^ Fy>^(x,yi,.. .,y m >yl> • ■ ■ >y' m ) — 0i M= !>•••> m - 

As a generalization of the variational notations given in Sec. 10.1, the 
expression 



is called the first variation of the functional. 

Variational problems for functionals involving higher derivatives of 
ui,..., u rn can be treated in the same way. 

Example 

Find the extremal of the functional 

J[y, z\~ [ ( y 12 + z' 2 + 2yz)dx , 

Jo 

1 /( 0 ) = 0, y(|) = 1, z(0) = 0, z(f) = -l. 

The Euler equations are 

y" — z = 0 , z" —y — 0 . 

Eliminating z, we have y ,v — y = 0 . Hence, 

y = c\e x + C 2 e~ x + C 3 cos a; + c\ sin a:, 


(7) 0 = I" 

J a 
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z = y" = cie x + c 2 e x — C 3 cos x — C 4 sin x. 
From the boundary conditions we obtain the solution 
y = sin x , z = — sin x. 


10.4. SEVERAL INDEPENDENT VARIABLES 
Consider the functional 

(1) J[u] = JJ F(x,y,u,u x ,u v )dxdy , 

where G is a finite, closed domain of the x, y-plane with a boundary curve C 
which has a piecewise continuously turning tangent, u(x, y) is a continuous 
twice differentiable function of x, y in G, and u x , u y denote partial deriva¬ 
tives du/dx , du/dy respectively. The function F(x, y, u, u x , u y ) is assumed 
to be twice continuously differentiable with respect to its five arguments. 
Let D be the set of all functions u(x, y) with the following properties: 

{ (a) u(x, y),u x (x , y),Uy( x, y),u xx (x, y ), 
u X y(x,y),u yy (x,y) continuous in G , 

(b) u(x, y) prescribed on C . 

These are the admissible conditions for u. We now seek a special func¬ 
tion u(x,y) = v(x,y) in the set D, which minimizes (or maximizes) the 
functional J[u). 

Let v(x, y) be a solution of this variational problem. Let r/(x, y) denote 
an arbitrary function with the properties 

{ (a) T)(x, y) has continuous derivatives up to the 
second order in G , 

(b) 7](x, y) = 0 for (x, y) on the boundary C . 

A consideration of the function 

(4) 4>(e) = J[v + erj], 

which attains an extremum when e = 0, again leads to the necessary 
condition 
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i.e., explicitly, with subscripts to F denoting partial derivatives of F, 

(6) 0 = JJ {F v (x, y,v, v x , v y )r) + F Vx (x, y, v,v x , Vy)r) x 

+ F Vy ( x, y, v, v x ,v y )rjy} dxdy . 

The last two terms can be simplified by Gauss’ theorem after rewriting 

Eq. (6) as* 

° = /x { p " ■ ” + £ {f - ■ n) - ^ f+ i {f ■ v) - r % F - ■ 

An application of Gauss’ theorem to the sum of the second and fourth terms 
in the integrand gives 


( 7 ) 0 = JJ G { Fv ~ i Fv *" h Fvv } v{x ' y) dxdy 

+ / R’JiW + F Vy n 2 (s)}r)ds. 

Jc 

Here ni(s) and n 2 (s) are the components of the unit outward-normal vector 
n(s) of C. 

The line integral in Eq. (7) vanishes according to Eq. (3). The func¬ 
tion Tj(x, y) in the surface integral is arbitrary. The generalized lemma of 
Sec. 10.1 then leads to the Euler equations 


Fv dx Fv * 



= 0, 


dF d 2 F d 2 F d 2 F dv d 2 F dv 
dv dv x dx dvydy dv x dv dx dv y dv dy 

cPFd^v 8 2 F d 2 v d 2 F d 2 v _ 

dv 2 dx 2 dv x dvy dxdy dv 2 dy 2 


Equation (9) is a necessary condition for a function u(x, y) in the set D 
minimizing (or maximizing) the functional (1). 


^The symbol ( d/dx)F Vx means that F Vx should be considered as a function of x and y, 
eg-, 

9 d ^ 9 F Vx dv dFv x dv x dF Vx dv v 

dx Vx dx ' ' dv dx dv x dx ^ dv y dx 
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In the same way one may treat variational problems connected with 
functionals which involve more than two independent variables, higher 
derivatives, and several unknown functions. 

Example 1 

■/[«] = JJ («* + «y) dxdy , 

with a boundary condition that u is equal to an assigned function f(x, y) 
on the boundary C of the domain D. The Euler equation is the Laplace 
equation v xx + v yy = 0. 

Example 2 



+ u ly + 2 u ly ) 2 dxd y = min • 


Then v must satisfy the biharmonic equation 


d 4 v d 4 v d 4 v 
dx 4 dx 2 dy 2 + dy 4 


0 . 


10.5. SUBSIDIARY CONDITIONS — LAGRANGIAN 
MULTIPLIERS 

In many problems, we are interested in the extremum of a function or 
functional under certain subsidiary conditions. As an elementary example, 
let us consider a function f(x,y) defined for all (x, y) in a certain domain 
G. Suppose that we are interested in the extremum of f(x, y), not for 
all points in G , but only for those points {x,y) in G which satisfy the 
relation 

(1) <j>{x,y)= 0. 

Thus, if the domain G and the curve (j>(x, y) ~ 0 are as shown in Fig. 10.5:1, 
then we are interested in finding the extremum of f(x, y) among all points 
that lie on the segment of the curve AB. For conciseness of expression, we 
shall designate such a subdomain, the segment AB , by a symbol G,p. 

Let us find the necessary conditions for f(x,y) to attain an extreme 
value at a point ( x,y ) in G'^, with respect to all points of a sufficiently 
small neighborhood of (x, y) belonging to G^. 
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Let us assume that the function <j>{x , y) has continuous partial deriva¬ 
tives with respect to x and y, <j> x = dtfjdx, <f> y = d(j>/dy, and that at the 
point ( x, y) not both derivatives are zero; say, 

(2) y) / 0. 

Then, according to a fundamental theorem on implicit functions, there 
exists a neighborhood of x, say x — £<x<x + <5((5>0), where the 
equation (j>(x , y) = 0 can be solved uniquely in the form 

(3) y = g(x). 

The function g{x) so defined is single-valued and differentiable, and 
<j>[x, 5 ( 1 )] = 0 is an identity in x. Hence 

(4) 0 = <j> x \x,g{x)\ + (j) y [x,g(x)] ( ^~ . 

Now, let us consider the problem of the extremum value of f{x,y) in 
G 4 ,. According to (3), in a sufficiently small neighborhood of (x,y), y is an 
implicit function of x , and fix, y) becomes 

(5) F(x) = f[x,g(x)\. 

If iF(x) attains an extreme value at x, then the first derivative of J-(x) 
vanishes at x , i.e., 

(6) 0 =~(x) = f x (x,y) + f y (x t y)^(x). 

But the derivative dg/dx can be eliminated between Eqs. (6) and (4). The 
result, together with Eq. (1), constitutes the necessary condition for an 
extremum of f{x,y) in G#. 
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The formalism will be more elegant by introducing a number A, called 
Lagrange’s multiplier , defined by 


(?) 


fy( x ,y) 
4>y( x , y)' 


A combination of Eqs. (4), (6), and (7) gives 


( 8 ) 


f x (x,y) + X(j> x (x,y) =0, 


while Eq. (7) may be written as 


(9) fy(x,y) + \<t> v (x,y)=0. 

Equations (1), (8), and (9) are necessary conditions for the function f(x, y) 
to attain an extreme at a point ( x,y ), where <fi x + > 0. These conditions 

constitute three equations for the three “unknowns” x, y, and A. 

These results can be summarized in the following manner. We introduce 
a new function 


(10) F(x, y\ A) = f(x, y) + \<j>(x, y). 

If the function f(x, y) has an extreme value at the point (x, y) with respect 
to G#, and if 

(11) [<t>x(x, y)} 2 + [<j>y{x, y)} 2 > 0, 

then there exists a certain number A so that the three partial derivatives of 
F{x, y; A) with respect to x, y, and A are zero at ( x, y, A): 

dF _ _ - 

— (a;,y, A) = fx{x,y) + X<t> x (x, y) =0, 
dF - 

(12) — (x, y, A) = f v (x, y ) + A <t> y {x, y) = 0, 
dF 

— (x,y, A) = <j>{x,y) =0. 

If there exist points ( x',y') in G with <t>{x',y') = 0 and (f> x (x',y') = 
4 > y (x',y') = 0, additional considerations are necessary. 

In the formulation (10) and (12), the theorem can be generalized to 
the case of n variables xi,...,x n and several subsidiary conditions 0 = 
.,x n ) = --- = 4> m {x 1 ,... ,x n ), m <n. 

The application of the Lagrange multiplier method to the minimization 
of functionals follows a similar reasoning. 
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As an example, let us consider the 
classical problem of geodesics: to find 
the line of minimal length lying on a 
given surface <j)(x , y, z) — 0 and join¬ 
ing two given points on this surface 
(Fig. 10.5:2). Here we must minimize 
the functional 

y ,_ 

(13) 1= / \/1 + t/' 2 + z' 2 dx , 

Jx 0 

with y(x), z(x) satisfying the 

condition <f>(x, y, z) = 0. This problem 
was solved in 1697 by Johann Bernoulli, but a general method of solution 
was given by L. Euler and J. Lagrange. 

We consider a new functional 

(14) I*[y,z, A] = J \yi + y 12 + + \(x)(f>(x,y, z)^ dx 

and use the method of Sec. 10.3 to determine the functions y{x), z(x), and 
X(x) that minimizes I*. The necessary conditions are 


(15) 

d 4> 

X dy 

d 

y' 

= 0, 

dx 

./I + y 12 + z' 2 

(16) 


d 

z' 

= 0, 

dz 

dx 

y/1 + y ' 2 + z' 2 

(17) 




= 0. 


This system of equations determines the functions y(x), z{x), and \{x). 

10.6. NATURAL BOUNDARY CONDITIONS 

In previous sections we considered variational problems in which the 
admissible functions have prescribed values on the boundary. We shall now 
consider problems in which no boundary values are prescribed for the ad¬ 
missible functions. Such problems lead to the natural boundary conditions. 

Consider again the functional (10.1:1), but now omit the boundary 
condition (10.1:2). Following the arguments of Sec. 10.1, we obtain the 
necessary condition (10.1:12), 

° = / { Fy ~ ~(L Fy ' } r) ^ dx + Fy ' ' ^ ' 



Fig. 10.5:2. A problem of geodesics 


(1) 
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This equation must hold for all arbitrary functions rj(x) and, in particular, 
for arbitrary functions 77(2;) with 77(a) = 77(6) = 0. This leads at once to the 
Euler equation (10.1:15) 

(2) F y (x,y,y') - F v '(x,y,y') = 0, in a < x <6. 

In contrast to Sec. 10.1, however, the last term F v ' ■ rj( x )\a> does not 
vanish by prescription. Hence, by Eqs. (1) and (2), we must have 

(3) (F y , ■ 77)1=6 - ( F y > ■ Tj) x=a - 0 

for all functions 77(2;). But now 77(a) and 77(6) are arbitrary. Taking two 
functions 771(2:) and 772(2:) with 

(4) 771(a) = 1, 771(6) = 0; 772(a) = 0, 772(6) = 1; 

we get, from (3), 

(5) Fy' [a, y(a), y\a)} = 0, Fy[b,y(b),y'(b)\ = 0. 

The conditions (5) are called the natural boundary conditions of our prob¬ 
lem. They are the boundary conditions which must be satisfied by the 
function y(x) if the functional J{u] reaches an extremum at u(x) = y(x), 
provided that y(a) and y(b) are entirely arbitrary. 

Thus, if the first variation of a functional J[u J vanishes at u{x) ~ y(x), 
and if the boundary values of u(x) at x = a and x — 6 are arbitrary, then 
y(x) must satisfy not only the Euler equation but also the natural boundary 
conditions which, in general, involve the derivatives of y(x). In contrast 
to the natural boundary conditions, the conditions u(a) = a, u(b) = ft, 
which specify the boundary values of u(x) at x = a and 6, are called rigid 
boundary conditions. 

The concept of natural boundary conditions is also important in a more 
general type of variational problem in which boundary values occur explic¬ 
itly in the functionals. It can be generalized also to functionals involving 
several dependent and independent variables and higher derivatives of the 
dependent variables. 

The idea of deriving natural boundary conditions for a physical problem 
is of great importance and will be illustrated again and again later. See, 
for example, Secs. 10.8 and 11.2. 
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10.7. THEOREM OF MINIMUM 
POTENTIAL ENERGY 
UNDER SMALL VARIATIONS 
OF DISPLACEMENTS 

Let a body be in static equilibrium un¬ 
der the action of specified body and sur¬ 
face forces. (Fig. 10 . 7 : 1 .) The boundary 
surface S shall be assumed to consist of 
two parts, S a and S u , with the following 
boundary conditions. 


V 

Over S a : The surface traction T) is prescribed. 
Over S u : The displacement m is prescribed. 


We assume that there exists a system of displacements u\, m, U3 satisfying 
the Navier’s equations of equilibrium and the given boundary conditions. 
Let us consider a class of arbitrary displacements Ui + 6 ui consistent with 
the constraints imposed on the body. Thus 6m must vanish over S u , but 
it is arbitrary over S„. We further restrict 6 m to be triply differentiable 
and to be of such an order of magnitude that the material remains elastic. 
Such arbitrary displacements 6m are called virtual displacements. 

Let us assume that static equilibrium prevails and compute the virtual 

V 

work done by the body force Fi per unit volume and the surface force T* 
per unit area: 

I Fi6mdv + f T* 6mdS. 

Jv Js 

On substituting T*= OijVj and transforming according to Gauss’ theorem, 
we have 


( 1 ) 


[ T* 6mdS = / Oij6u l VjdS 
Js Js 

— J* ((Tij6m),j di. 


I ffijjSui dv + / OijSuij dv . 

Jv Jv 


According to the equation of equilibrium the first integral on the right-hand 
side is equal to — / Fi 5 m dv. On account of the symmetry of <Jij, the second 
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integral may be written as 


j a ij ^ + & u j,i) dv = f rjijSeij dv. 

Jv 2 ’ J v 

Therefore, Eq. ( 1 ) becomes 

( 2 ) A j Fi 6 uidv+ I T* SuidS — j aijSeijdv. 

Jv Js, Jv 

This equation expresses the principle of virtual work. The surface integral 
needs only be integrated over S a , since Sui vanishes over the surface S u 
where boundary displacements are given. 

If the strain-energy function W(e n,ej2,...) exists, so that Oij = 
dW/deij, (Sec. 6 . 1 ), then it can be introduced into the right-hand side 
of Eq. ( 2 ). Since 

f f dW f 

( 3 ) / crijSeijdv= I — — 5 eijdv = 6 I W dv , 

Jv Jv ° e ij Jv 

the principle of virtual work can be stated as 

( 4 ) A S [ Wdv- [ Fiduidv- f T* 5 Ui dS = 0. 

Jv Jv Js a 

Further simplification is possible if the body force F t and the surface 
tractions T* are conservative so that 

< 6 > *—£;• 

The functions G(u i,U2,U3) and g(ui,U2,u$) are called the potential of 
and T*, respectively. In this case, 

(6) - [ FiSmdv- f T* 6 Ui dS = 8 [ Gdv + 6 [ gdS. 

Jv J Sa Jv Js 


Then Eq. ( 4 ) may be written as 


< 5 V = 0 , 


V= f ( W + G)dv+ f gdS. 
Jv Js, 
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The function V is called the potential energy of the system. This equa¬ 
tion states that the potential energy has a stationary value in a class of 
admissible variations Sui of the displacements u, in the equilibrium state. 
Formulated in another way, it states that, of all displacements satisfying 
the given boundary conditions, those which satisfy the equations of equi¬ 
librium are distinguished by a stationary ( extreme ) value of the potential 
energy. For a rigid-body, W vanishes and the familiar form is recognized. 
We emphasize that the linearity of the stress-strain relationship has not 
been invoked in the above derivation, so that this principle is valid for 
nonlinear, as well as linear, stress-strain law, as long as the body remains 
elastic. 

That this stationary value is a minimum in the neighborhood of the 
natural, unstrained state follows the assumption that the strain energy 
function is positive definite in such a neighborhood (see Secs. 12.4 and 
12.5). This can be shown by comparing the potential energy V of the actual 
displacements Ui with the energy V of another system of displacements 
Ui + Sui satisfying the condition Su, = 0 over S u . We have 


( 9 ) 


V' — V = [ [W (en + Jen,...,) - W(e n , ...,)}dv 
Jv 


— J Fidui dv — J 


T* 5m dS. 


Expanding W{e n + Jen ,...,) into a power series, we have 


( 10 ) 


dW 

W(e n + Jen,---,) = ^(en.) + gfT. Se i 


1 <9 2 W 

2 deijdefii 


SeijSeid + • ■ ■ 


A substitution into Eq. (9) yields, up to the second order in Jey, 


(11) V' - V = [ - Seijdv - [ FiSuidv— [ T* SuidS 

Jv Oeij Jv Js 

f 1 d 2 W . . , 

+ / x-R— a — deijdekidv. 

Jv 2 deijdeu 

The sum of the terms in the first line on the right-hand side vanishes on 
account of Eq. (4). The sum in the second line is positive for sufficiently 
small values of strain Jejj as can be seen as follows. Let us set ey = 0 in 
Eq. (10). The constant term W(0) is immaterial. The linear term must 
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vanish because dW/deij — cr l j, which must vanish as e,j —> 0. Hence, up 
to the second order, 

1 d^W 

< 12 > 5 = 

Therefore, Eq. (11) becomes 

(13) V'-V = J W(6eij)dv. 

If W(6eij) is positive definite, then the last line in Eq. ( 11 ) is positive, and 

(14) V' - V > 0 


and that V is a minimum is proved. Accordingly, our principle is called 
the principle of minimum potential energy. The equality sign holds only if 
all 6eij vanish, i.e., if the virtual displacements consist of a virtual rigid- 
body motion. If there were three or more points of the body fixed in space, 
such a rigid-body motion would be excluded, and V is a strong minimum; 
otherwise it is a weak minimum. 

To recapitulate, we remark again that the variational principle ( 2 ) is 
generally valid; Eq. (4) is established whenever the strain energy function 
^( 611 , 612 , ■••,) exists; and Eq. (7) is established when the potential en¬ 
ergy V can be meaningfully defined, but the fact that V is a minimum for 
“actual” displacements is established only in the neighborhood of the stable 
natural state, where W is positive definite. 

Conversely, we may show that the variational principle gives the equa¬ 
tions of elasticity. In fact, starting from Eq. (7) and varying u,, we have 


(15) SV = J dv- J FiSui dv- J T* 6 Ui dS = 0 . 


But 


f dw i r 

J v de~- Seijdv = 2 Jv ° ij ( SUi,j + SUj,i ^ dv 

= — Oij j j5uidv+ I OijUjSui dS. 

Jv ’ Js 


Hence, 


(16) J (aijj + Fi)6uidv + f (cTiji/j- T*)6uidS — 0, 

Jv Js 
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which can be satisfied for arbitrary 5ui if 

(17) aij t j + Fi = 0 in V 

(18) Sui =0 on S u (rigid boundary condition), 

(19) T*= (Tiji/j on S„ (natural boundary condition). 

The one-to-one correspondence between the differential equations of 
equilibrium and the variational equation is thus demonstrated; for we have 
first derived Eq. (7) from the equation of equilibrium and then have shown 
that, conversely, Eqs. (17)-(19) necessarily follow Eq. (7). 

The commonly encountered external force systems in elasticity are con¬ 
servative systems in which the body force Fi and surface tractions Ti are 
independent of the elastic deformation of the body. In this case, V is more 
commonly written as 

(20) V = [ Wdv- [ Fniidv- [ T* Ui dS. 

Jv Jv Js a 

A branch of mechanics in which the external forces are in general 
nonconservative is the theory of aeroelasticity. In aeroelasticity one is con¬ 
cerned with the interaction of aerodynamic forces and elastic deformation. 
The aerodynamic forces depend on the flow and the deformation of the 
entire body, not just the local deformation; thus in general it cannot be 
derived from a potential function. The principle of virtual work, in the 
form of Eq. (4), is still applicable to aeroelasticity. 

Problem 10.1. The minimum potential energy principle states that elastic 
equilibrium is equivalent to the condition J = min, where 

(a) J = f [W(eij) — FiUi] dV — f Ti UidS, (varying it;) ; 

Jv J s a 

(b) ey = | (wij + Uj,i ) . 

By the method of Lagrange multipliers, the subsidiary condition (b) can be incor¬ 
porated into the functional J , and we are led to consider the variational equation 

(c) SJ = 0, (varying ey, u,, Ay independently) , 

where 
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Since the quantity in [ ] is symmetric in i, j, we may restrict the Lagrange 
multipliers Ay to be symmetric, Ay = Ay, so that only six independent multipliers 
are needed. Derive the Euler equations for (c) and show that, —Ay should be 
interpreted as the stress tensor. 

Note: Once Lagrange’s multipliers are employed, the phrase “minimum con¬ 
ditions” used in the principle of minimum potential energy has to be replaced by 
“stationary conditions.” 


10.8. EXAMPLE OF APPLICATION: STATIC LOADING ON 
A BEAM —NATURAL AND RIGID END CONDITIONS 

As an illustration of the application of the minimum potential energy 
principle in formulating approximate theories of elasticity, let us consider 
the approximate theory of bending of a slender beam under static loading. 
Let the beam be perfectly straight and lying along the x-axis before the 
application of external loading, which consists of a distributed lateral load 
p(x) per unit length, a bending moment Mo, and a shearing force Qo at the 
end i = 0, and a moment Mi and a shear Qi at the end x = l (Fig. 10.8:1). 
We assume that the principal axes of inertia of every cross section of the 
beam lie in two mutually orthogonal principal planes and that the loading 



Fig. 10.8:1. Applications to a simple beam. 


p, M, Q are applied in one of the principal planes. In accordance with the 
approximate beam theory, we assume that every plane cross section of the 
beam remains plane during bending. If the deformation is small, then 
the analysis given in Sec. 7.7 applies. 

Let ds be the arc length along the longitudinal neutral axis. Under 
these assumptions, we see that when the neutral axis of the beam is bent 
from the initial straight line into a curve with radius of curvature R, the 
length of a filament, initially ds and parallel to the neutral axis, is altered 
by the bending in the ratio 1 : (1 + rj/R), where rj is the distance between 
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the filament and the neutral axis. The strain is r)/R, and the force acting 
on the filament is ErjdA/R, where dA is the cross section of the filament. 
The resultant moment of these forces about the neutral axis is denoted by 
M, which is 

“-L'■*$?“- !//“-§■ 

where I is the moment of inertia of the area of the beam cross section. 

The bending moment acting on the cross sections at the ends of a beam 
segment of length ds does work if the cross sections rotate relative to each 
other. Let 8 be the angle between a tangent to the neutral axis and the 
z-axis. In a segment of length ds, the tangent rotates by an amount dO. 
The rate of change d9/ds is, by definition, the curvature of the neutral axis. 
R being the radius of curvature, (see Fig. 10.8:1), ddjds = 1 jR. Therefore, 
the angle through which the cross sections rotate relative to each other is 
ds/R. Hence, the work to bend a segment ds of the beam from a curvature 
of zero to a curvature 1 /R is 


I' 


where the factor | is added since the mean work is half the value of the 
product of the final moment and angle of rotation. Hence, integrating 
throughout the beam, we have the strain energy 


( 1 ) 




If we now assume that the deflection of the beam is infinitesimal so that 
if y denotes the lateral deflection, the curvature 1/7? is approximated by 
d 2 y/dx 2 , and ds is approximated by dx, then 



The potential energy of the external loading is, with the sign convention 
specified in Fig. 10.8:1 and with yo, yi, (dy/dx) 0 , ( dy/dx ), denoting the 
value of y and dy/dx at x = 0, l, respectively, 

- Jp(x)y(x)dx + Mo - M, (!)_ - 


P) 


Qoyo + Qm ■ 
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Hence, the total potential energy is 


< 3 > 


MA$ 


dx )! 


Qoyo + Qiyi ■ 


At equilibrium, the variation of V with respect to the virtual displacement 
6y must vanish. Hence, 


(4) 

(5) 


5V: 


£«3 




d 2 y , ( d 2 y 


dx 2 


M[6 


( 

\dx) i 


pSydx 

Qofyo + Qrfyi = o. 


Integrating the first term by parts twice and collecting terms, we obtain 

fl 

ufir m _, 

dx 2 J 


d 2 y \ 

dx 2 ) j 


6y dx 


+ \EI 


Mi 


61 $ I - 


dx 


Qi 


dx 

tiyi + 


El 


(£).-* 


^y\ 

dx) o 


d (d 2 y\ 

— pj I —± _ 

dx 2 ) o 


dx 


Qo 


<tyo 


Since Sy is arbitrary in the interval (0, l), we obtain the differential equation 
of the beam 


(5) 


dx 2 ' 


In order that the remaining terms i 
the end conditions 

(6a) 

Either 

El 

f d 2 y\ 
\dx 2 ) l 

(6b) 

Either 

El ( 

d 2 y\ 
dx 2 ) o 


p— 0, 


Mi — 0 or 5 


0 < x < 


($) = 0 . 
\dxj, 


Mo = 0 or & [-p- 
dx 


0 . 
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( 6 c) Either 
( 6 d) Either 


i{ EI U)^ Q,=0 “ Su, = ° ■ 

-Qo = 0 or Syo = 0 . 

dx \ dx z j 0 


If the deflection y 0 is prescribed at the end x = 0 , then Syo = 0 . If the 
slope (dy/dx) o is prescribed at the end x — 0, then S(dy/dx) o = 0. These 
are called rigid boundary conditions. On the other hand, if the value of yo 
is unspecified and perfectly free, then Syo is arbitrary and we must have 


(7) 


-Qo = o 


dx 


dx 2 J 0 


as an end condition for a free end subjected to a shear load Qo\ otherwise 5V 
cannot vanish for arbitrary variations Syo. Equation (7) is called a natural 
boundary condition. Similarly, all the left-hand equations in Eqs. ( 6 a)-( 6 d) 
are natural boundary conditions, and all the right-hand equations are rigid 
boundary conditions. 

The distinction between natural and rigid boundary conditions assumes 
great importance in the application of the direct methods of solution of 
variational problems; the assumed functions in the direct methods must 
satisfy the rigid boundary conditions. See Sec. 11 . 8 . 

It is worthwhile to consider the following question. The reader must 
be familiar with the fact that in the engineering beam theory, the end 
conditions often considered are: 


( 8 a) 

Clamped end: 

y — 0 , 

dy _ 
dx 



( 8 b) 

Free end: 


= 0 , 

d 

dx 

fa 

su a- 

w 

11 

o 


cpy 

( 8 c) Simply supported end : y — 0, El —^ = 0 , 

dx 2 

where y(x) is the deflection function of the beam. May we ask why are the 
other two combinations, namely 


(9a) 

(9b) 


J/ = 0 ’ Tx [EI 


r d 2 y 
dx 2 


= 0 , 


fUo, o, 

dx dx z 


never considered? 
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An acceptable answer is perhaps that the boundary conditions (9a) and 
(9b) cannot be realized easily in the laboratory. But a more satisfying 
answer is that they are not proper sets of boundary conditions. If the 
conditions (9a) or (9b) were imposed, then, according to Eq. (4), it cannot 
at all be assured that the equation 6V = 0 will be satisfied. Thus, a basic 
physical law might be violated. These boundary conditions are, therefore, 
inadmissible. 

From the point of view of the theory of differential equations, one may 
feel that the end conditions (9a) or (9b) are legitimate for Eq. (5). Never¬ 
theless, they are ruled out by the minimum potential energy principle on 
physical grounds. In fact, in the theory of differential equations the Eq. (5) 
and the end conditions (8) are known to form a so-called self-adjoint dif¬ 
ferential system, whereas Eqs. (5) and (9) would form a nonself-adjoint 
differential system. Very great difference in mathematical character exists 
between these two categories. For example, a free vibration problem of a 
nonself-adjoint system may not have an eigenvector, or it may have complex 
eigenvalues or complex eigenvectors. 

There are other conceivable admissible boundary conditions, such as to 
require 


(io > §-»• s( E 'S0= o - 

Such an end, with zero slope and zero shear, cannot be easily established 
in the laboratory. Similarly, it is conceivable that one may require that at 
the end x — 0, the following ratios hold: 


( 11 ) 


5 



: Sy = c, 


a constant, 





= c, 


the same constant. 


This pair of conditions are also admissible, but are unlikely to be encoun¬ 
tered in practice. 


10.9. THE COMPLEMENTARY ENERGY THEOREM UNDER 
SMALL VARIATIONS OF STRESSES 

In contrast to the previous sections let us now consider the variation 
of stresses in order to investigate whether the “actual” stresses satisfy a 
minimum principle. We pose the problem as in Sec. 10.7 with a body held in 
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equilibrium under the body force per unit volume, F u and surface tractions 

V 

per unit area, T*, over the boundary S a , whereas over the boundary S u 
the displacements are prescribed. Let Oij be the “actual” stress field which 
satisfies the equations of equilibrium and boundary conditions 

^ <j ijtj +Fi = 0 inV", 

V 

OijVj—T* on S a . 

Let us now consider a system of variations of stresses which also satisfy the 
equations of equilibrium and the stress boundary conditions 


(2) 


{So ij)j + SFi — 0 

v 

{6oij ) Uj — S Ti 
Soij are arbitrary on S u . 


in 

on S a , 


In contrast to the previous sections, we shall now consider the complemen¬ 
tary virtual work , 

/ UiSFi dv 4- I mS Ti dS , 

Jv Js 

which, by virtue of Eq. (2) and through integration by parts, 

= — / Ui{6oij) } j dv + / Ui{5oij)vjdS 

Jv ’ Js 

— / {Soij)uij dv — / UiVj{Soij)dS + / Ui{5oij) 

Jv ’ Js Js 

= \ J v ( s °ij){ui,j +u jt i)dv 

= I 

Hence, 


VidS 


iSoa dv. 


(3) k j eij&Oij dv — j UiSFi dv + f UiS Ti dS . 

Jv Jv Js 

This equation may be called the principle of virtual complementary work. 
Now, if we introduce the complementary strain energy W C J which is a 

tThe Gibbs’ thermodynamic potential (Sec. 12.3) per unit volume, p<I>. is equal to the 
negative of the complementary strain energy function. If the stress-strain law were linear, 
then W c (<Tij) and W'(ey) are equal: —= W c = W (for linear stress-strain law). 
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function of the stress components d\\,a 12 , ..., and which has the 
property that, 


(4) 


dw c _ 

daij 6ij 


then the complementary virtual work may be written as 

(5) J UiSFidv + J UiS Ti dS = J ~q—~ S<Jijdv = 6 J W c dv. 

Since the volume is fixed and Ui are not varied, the result above can be 
written as 


( 6 ) A (SV* = 0, 

V 

where V* as a function of the stresses 041 , 012 ,..., the surface traction Ti 
and the body force per unit volume Fi, is defined as the complementary 
energy 

(7) A V*(a lu ...,Fi)= / W c dv- [ UiFidv- f Ui Ti dS. 

Jv Jv Js 

In practice, we would like to compare stress fields which all satisfy the 
equations of equilibrium, but not necessarily the conditions of compatibility. 

V 

In other words, we would have SFi = 0 in V and S Ti = 0 on S a . In this case 

V 

Scrij and, hence, 5 Ti are arbitrary only on that portion of the boundary 
where displacements are prescribed, S u . Therefore, only a surface integral 
over S u is left in the left-hand side of Eq. (5) and we have 

(8) A V> n .o 33 )= [ W c dv- f UiTidS. 

Jv Js u 

Therefore, we have the following theorem 

Theorem. Of all stress tensor fields aij that satisfy the equation of equilib¬ 
rium and the boundary conditions where stresses are prescribed, the “actual” 
one is distinguished by a stationary ( extreme ) value of the complementary 
energy V*(<rn,... ,033) as given by Eq. ( 8 ). 

In this formulation, the linearity of the stress-strain relationship is not 
required, only the existence of the complementary strain energy function is 
assumed. However, if the stress-strain law were linear, and the material is 
isotropic and obeying Hooke’s law, then 

nr t \ V , s2 , 1 + v 

” c (CT 11 , . . . , 033] — — (^aa) H 2 pj • 


( 9 ) 
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We must remark that the variational Eqs. (10.7:2) and Eq. (3) of the 
present section are applicable even if the body is not elastic , for which the 
energy functional cannot be defined. These variational equations are used 
in the analysis of inelastic bodies. 

Before we proceed further, it may be well worthwhile to consider the 
concept of complementary work and complementary strain energy. Con¬ 
sider a simple, perfectly elastic bar subjected to a tensile load. Let the 
relationship between the load P and the elongation of the bar u be given 
by a unique curve as shown in Fig. 10.9:1. Then the work W is the area be¬ 
tween the displacement axis and the curve, while the complementary work 
is that included between the force axis and the curve. Thus, the two ar¬ 
eas complement each other in the rectangular area (force) • (displacement), 
which would be the work if the force were acting with its full intensity from 
the beginning of the displacement. Naturally, W and W c are equal if the 
material follows Hooke’s law. 




Fig. 10.9:1. Complementary work and strain energy. 

The principle of minimum potential energy was formulated by Willard 
Gibbs; and many beautiful applications were shown by Lord Rayleigh. 
The complementary energy concept was introduced by F. Z. Engesser; its 
applications were developed by H. M. Westergaard. In the hands of 
Kirchhoff, the minimum potential energy theorem becomes the founda¬ 
tion of the approximate theories of plates and shells. Argyris has made the 
complementary energy theorem the starting point for practical methods of 
analysis of complex elastic structures using modern digital computers. See 
Bibliography 10.2, p, 494. 

Let us now return to the complementary energy theorem. We shall 
show that in the neighborhood of the natural state, the extreme value of 
the complementary energy V is actually a minimum. By a natural state 
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is meant a state of stable thermodynamic existence (see Sec. 12.4). In 
the neighborhood of a natural state, the thermodynamic potential per unit 
volume p<f> can be approximated by a homogeneous quadratic form of the 
stresses. For an isotropic material, — /><£ is given by Eq. (9), in which W c 
is positive definite. 

The proof that V* is a minimum is analogous to that in the previous 
section, and it can be sketched as follows. Comparing + <5eqj) with 

V*(<Tij), we have 

(10) V*(<Tn + <5(711, •■•>)- V*(<7n, ...,) 

= f [W c (<7u +< 5 ( 7 h ,. ..,)- W c (a n , ... ,)}dv 


L 


[Ti ((7u -f San, ...,)— Ti ( cr n ,...,)]uidS 

<7 {j — &ctoL&ij \ d&ij "b W c (ScTn, . . . , ) 


dv 


- / (Saij)i'jUidS 

Js u 


where 

( 11 ) 


= J eijScrij dv — J {boij)v 3 u,i dS + J W c (5an,...dv 

= <5V* + / W c (6<th, <5<7i2, •.., )dv, 

Jv 


W c (6an,...,) = (<5<7jj)(<5<7ij) - j^( Sa a) 2 ^ °- 


bF c (<5<7ii,...,) is positive definite for infinitesimal variations <5<7y. Hence, 
when 5V* = 0, 


(12) V*(<7ii + <5(7n,. • •,) - V>n, ...,)> 0, 

and that V*(crn,. ..,) is a minimum is proved. 

The converse theorem reads: 

Let V* be the complementary energy defined by Eq. (8). If the stress 
tensor field Oi 3 is such that <5V* = 0 for all variations of stresses Saij 
which satisfy the equations of equilibrium in the body and on the boundary 
where surface tractions are prescribed, then <Jij also satisfies the equations 
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of compatibility. In other words, the conditions of compatibility are the 
Euler equation for the variational equation 5V* = 0. 

The proof was given by Richard V. Southwell 1,2 (1936) through the 
application of Maxwell and Morera stress functions. We begin with the 
variational equation 

(13) JV* = f eijScrijdv— J UiS Ti dS = 0. 

Jv Js u 

The variations Jcr^- are subjected to the restrictions 


(14) (foij)j =0 in V, 

(15) {b<Jij)vj =0 onS„. 


To accommodate the restrictions (14) into Eq. (13), we make use of the cel¬ 
ebrated result that the equations of equilibrium (14) are satisfied formally, 
as can be easily verified, by taking 


(16) 


San = $ 22,33 + $ 33,22 — 2 $ 23,23 , 

8(T22 = $ 33,11 + $ 11,33 — 2 (^ 31,31 , 

5(733 = 011,12 + $ 22,11 — 2 $ 12,12 , 

( 5(723 — $ 31,12 + $ 12,13 “ $ 11,23 — $ 23,11 , 

5(731 = $ 12,23 + $ 23,21 — $ 22,31 — $ 31,22 , 

5(712 = $ 23,31 + $ 31,32 ~ $ 33,12 ~ $ 12,33 , 


where $,j = <pji are arbitrary stress functions. On setting $12 = $23 = 
$31 = 0, we obtain the solutions proposed by James Clerk Maxwell. On 
taking $ a = $22 = $33 = 0, we obtain the solutions proposed by G. Morera. 

Let us use the Maxwell system of arbitrary stress functions for the 
variations 8(7 ij. Equation (13) may be written as 


(17) JV* = / [en($22,33 + $33,22) + e22($33,ll + $11,33) 

Jv 

+ 633($11,22 + $22,ll) — 2e23$ll,23 
— 2e3i$22,3i — 2ei2$33,i2] ^ V J Ui ^ — 0 • 
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Integrating by parts twice, we obtain 

(18) SV* — / [(e22,33 + 633,22 — 2e23,23)^ll 

Jv 

+ (^33,11 + 611,33 — 2e 3 i,3i)(/>22 

+ (en ,22 + e 22 ,n - 2 e 12 ,i 2 )^ 33 ] dv + a surface integral = 0. 

Inasmuch as the stress functions 4>n, (f> 22 , d >33 are arbitrary in the volume 
V, the Euler’s equations are 

(19) 622,33 + 633,22 — 2623,23 = 0, 

etc., which are Saint-Venant’s compatibility equations (see Sec. 4.6). The 
treatment of the surface integral is cumbersome, but it says only that over 
S u the values of u, are prescribed and the stresses are arbitrary; the deriva¬ 
tion concerns a certain relationship between <j>ij and their derivatives to be 
satisfied on the boundary. 

Similarly, the use of Morera system of arbitrary stress functions leads 
to the other set of Saint-Venant’s compatibility equations 

(20) en,23 = —e23.11 + e 3 i,2i + ei 2 , 3 i, 

etc. [Eq. (4.6:4)]. 

If we start with <SV* in the form 

(21) <5V* = 0 = J ~ j^Vaadij'j &<?ij dv + J urf Ti dS 

and introduce the stress functions, the Beltrami-Michell compatibility 
equations 

(22) V 2 < 7ij + ——6 ij + - SijFk'k + Ftj + Fj i = 0, in V 

1+v 1~v 

where 6 = a aa , can be obtained directly. 

In concluding this section, we remark once more that when we consider 
the variation of the stress field of a body in equilibrium, the principle of 
virtual complementary work has broad applicability. The introduction of 
the complementary energy functional V*, however, limits the principle to 
elastic bodies. That V* actually is a minimum with respect to all admissible 
variations of stress field is established only if the complementary strain 
energy function is positive definite. 
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There are many fascinating applications of the minimum complementary 
energy principle. In Sec. 10.11, we shall consider its application in proving 
Saint-Venant’s principle in Zanaboni’s formulation. 

Problem 10.2. The principle of virtual complementary work states that 

(23) j eijSaijdv — f u*5 T, dS = 0 

Jv Js u 

under the restrictions that 

(24) = 0 in V , 

(25) StTijVj = 0 on S a , 

(26) Mi = u* prescribed on S u , but <5 T, = &a ijVj arbitrary on S u . 

Using Lagrange multipliers, we may restate this principle as 

(27) / eijSaijdv - / u*S Ti dS+ XiSaijj dv — / piSoijVjdS = 0 . 

Jv Js n Jv Js a 

The six equations (24) and (25), * = 1,2, 3, require six Lagrange multipliers A<, jii 
which are functions of (aq, X 2 , X 3 ), Show that the Euler equations for Eq. (27) 
yield the physical interpretation 

(28) Ai = Ui, ^ = Ui. 

10.10. VARIATIONAL FUNCTIONALS FREQUENTLY USED IN 
COMPUTATIONAL MECHANICS 

We shall introduce hybrid and mixed variational principles, which are 
used extensively in finite element applications. The finite element method 
is one of the most powerful numerical methods for solving linear and non¬ 
linear problems. The method approximates the field variables in terms of 
unknown parameters in sub-regions called “ elements” and then arrives at 
an approximate solution for the whole domain by enforcing certain relations 
among the unknowns. Variational principles are often used to establish the 
relations. The details of finite element applications will be discussed in 
Chapters 18-21. 

Variational principles are classified into irreducible, hybrid, or mixed 
variational principle on the basis of the nature of the set of unknown 
functions in the functionals. If the set of unknown functions (called field 
variables) is irreducible in some sense or cannot be eliminated from the 
functional until the solution of the problem has been obtained, then the 
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functional is said to be irreducible. For example, in three dimensions, a 
displacement has components iy,i = 1,2,3. In most cases, none of the 
components can be expressed in terms of others and eliminated from the 
functional a priori. Thus the minimum potential energy principle based on 
Ui is irreducible. So are the stress components ay and thus the complemen¬ 
tary energy theorem based on ay is also irreducible. On the other hand, a 
functional of both ty and ay is reducible because stresses can be espressed 
in terms of strains through the constitutive law and thus be eliminated from 
the functional. Such a functional is said to be mixed. Further, in finite ele¬ 
ment analysis a continuum is divided into elements. One could choose the 
field variables on the boundary of the element different from those over the 
domain of the element. Then the functional and the associated variational 
principle are said to be hybrid. 

In the subsequent discussion of mixed and hybrid functionals we shall 
use interchangibly Fy 6, for the body force; dV, S for the boundaries of 
V ; i/y ni for unit normal to a boundary surface; <914, S u for the bound¬ 
aries where displacements are prescribed; dV<,, S„ for the boundaries where 

tractions are prescribed; iy, u* for the prescribed boundary displacements; 
v_ v 

and T, T * for the prescribed traction. 

Reissner-Hellinger Principle. For materials satisfying Hooke’s law, the 
Reissner-Hellinger principle uses the functional 

(1) n*(ay,<y) — J" ^ayey — CyfclOycTfcl biU^jdV ^ Ti U{ dS , 

where V is the volume of the domain, Cy*.i are the elastic flexibility tensor 
of rank 4, ay and ey are Cartesian stress and strain tensors of rank 2, iy 
are the displacements, b{ are the body force components, and <914 is the 

V 

portion of the boundaries of V over which traction Tj are prescribed. The 
strains are in terms of the displacements 

(2) ey = {a l j -(- Uj^l 2, 

and the displacements are prescribed over dV u (= dV — 914) so that 

(3) Ui = Ui 

in which iy are known functions. The independent field variables ay and 
Ui are limited by the admissibility conditions that either the displacements 
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or the traction Ti (= nj&ji) are C° continuous t over any internal surfaces 
and that u t satisfy Eq. (3). Since Eq. (3) is part of the admissibility re¬ 
quirements, which must be satisfied a priori and is called a rigid condition. 
The first variation of II/? with respect to the field variables is 

dllfl(<7y, Ui) = / Oij&eij dV — / biSuidV 

Jv Jv 

~b / ( e ij dV / Tj Sui dS. 

Jv JdV a 

The necessary condition for IIr to be stationary is that <5 IIr vanishes for all 
admissible variations of the independent field variables. Using the Gauss 
theorem for the first integral yields the stationarity condition 

(4) (TlROi^Ui) = - / +bi)6uidV + / (ea - C ijkl <7 k i)SaijdV 

Jv Jv 

+ J (rijcjji— Ti)5ui dS = 0, 

JdV„ 

where the n’s are the components of a unit normal to the element bound¬ 
aries. Since <5 IIr must be zero for arbitrary admissible 5ui and dcr^, 
the integrands of all three integrals must vanish. Vanishing of the three 
integrals implies, respectively, 


(5) 

<Tji t j — 0 

equations of equilibrium in V, 

(6) 

fyj — ^ijkl^kl 

strain-stress relations in V, 

(?) 

Tlj &ji — T-i 

traction boundary conditions on dV a 


The functional II r has both stresses and displacements as independent 
fields. Since the stresses can be expressed readily in terms of the dis¬ 
placements from Eq. (6), the functional is not irreducible. The variational 
formulation based on such a functional is a mixed principle. 

If the domain V is divided into subdomains, called elements, for admis¬ 
sible Oij, Ui we can write IIr in the form (Pian and Tong 1969) 

(8) IIr = ^2 Hr e(<7ij,Ui), 

all elements 

t A function u(x, y, z), which is continuous in a domain V, is said to be continuous of the 
order of zero, and is denoted by C°. If u and its partial derivatives are all continuous 
in V, then u is said to be continuous of the order of 1, and denoted by G' 1 . If u and 
all of its partial derivatives up to order n are continuous in V, then u is said to be C n 
continuous. 
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where IlR e is the corresponding functional for an element 


(9a) 


HRe(£frj) ^i) — J dV J" l ', UidS 


with V e and dV ae being, respectively, the element volume and the por¬ 
tion of element boundaries over which traction is prescribed. For isotropic 
materials, Eq. (9a) reduces to 


(9b) 


nRe(Ojj, Ui 



1 

a ij e ij ~ ^ 


(l 2G-3K 

[ 2G ^ aij+ is gk 




Ti UidS 


In finite element applications to be discussed in later chapters, one assumes 
admissible u’s and er’s in terms of unknown parameters within each element 
and uses the stationarity condition Eq. (4) to establish the equations for 
the unknown parameters. Note that (2 G — 3K)/(9K) = —i//(l + v). 


A Four-Field Hybrid Variational Principle. One can relax the ad¬ 
missibility requirements at the element boundaries dV e by introducing La- 
grangian multipliers u,, 1\ along dV e of each element (Tong and Pian 1969, 
Pian and Tong 1972) to modify IlRe in Eq. (9) such that 

(10) II// = ^ , 

all elements 

where 

II// e ((7 , Uj, Ui, Ti ) — / Ti (xq Ui ) dS 

JdV c 

~i j Cjj 2 C'ijklGij&kt d\ ’ Ti Ui dS. 

The admissibility conditions for Lagrangian multipliers are: xq are the same 
for any two adjacent elements along their common boundaries and satisfy 
the prescribed condition Eq. (3), i.e., xq = xq. The functional II// has four 
independent fields <7y, xq, xq, T,, of which ery, xq are defined over V e , and 
Ui, Ti are defined on dV e only. As the field variables over dV e differ from 
those inside V e , II// is a hybrid functional. The variational formulation 
based on such a functional is called a hybrid principle. 

The stationarity condition of II// is 

( 11 ) dU H = ^2 SUj{ e = 0 

all elements 
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where 


— / i&jiJ “J - bi)5ui dV -f~ 

1 ( I'lj T/ ijfclO k\)b(7 ij dV 

JV e 

Jv, 

f Ifi'i — Ui)8Ti dS 4- 

/ (njOji - Ti)6ui dS 

JdV e J 

'dV e 

+ [ (f i -T i )5u i dS + 

f TiduidS. 

J &V at 

JdV c -dV„' 


Since cqj, iq over V e , Ti on dV ae , and ii l on 0V ar of all elements are inde¬ 
pendent of each other, Eq. (11) requires that the first five integrals above 
are zero for each element. The vanish of the first two integrals implies the 
equations of equilibrium and the strain stress relations within each element. 
The third through fifth integrals give, respectively, 


(12) 

Ui = Ui 

on dV e , 

(13) 

Ti njdji 

on dV e , 

(14) 

Ti=Ti 

on dV ae 


Since din are common for elements along their common boundaries, the 
last integral of Eq. (11a) require that 

(14a) V [ TiduidS^ 0, or [ [(?<)i- (T^duidS = 0, 

all element Jf>V.-dV„ JS< 


where S e denotes the common boundary of two adjacent elements and (T))i, 
(Ti )2 are the tractions of the two elements. Equation (14a) implies the 
traction X) is continuous across inter-element boundaries and is called the 
inter-element equilibrium condition. 

One may write Eq. (10) in a slightly different form 


II//1 = ^2 II//ie(ejj, iq, Ui, Ti ), 

all elememts 

where 


H/f le(&ij j U/j Ti) — / T)(Uj Ui) dS 

JdV e 

+ j y \D ijkl e kl ^i±^ i 


- D 


ijkl' 




— bjUi 


dV 
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in which e^, m, Hi, Ti are the four independent fields and Dijki{= ) is 
the elastic coefficient or modulus tensor, the inverse of the elastic flexibility 
tensor Cijki . The variational formulation associated with II hi is called the 
hybrid strain principle. 

Hybrid Stress Principle. If the stresses satisfy the equilibrium equations 
Eq. (5) and the tractions T% satisfy Eq. (13) a priori, one can eliminate u’s 
and 7”s from Eq. (10) and derive a new functional (Tong and Pian 1969) 


(15) 

where 

II HSe(<Tij,Ui) 


n HS = ^2 n HSe((Tij,Ui) , 

all elements 

f njtjjiUidS — ^ [ Cijki(TijcrkidV - [ TiUidS. 

JdV' 2 J V ' J av ■„ 


II hs is a two-field hybrid functional, which is the basis for the hybrid 
stress finite element model proposed by Pian (1964). The admissibility 
conditions require that: fij are common along the common boundaries of 
any two adjacent elements and satisfy the prescribed displacement condition 
Eq. (3); and the stresses satisfy the equilibrium equations Eq. (5). The first 
variation of TlftSe is 






C'ijkl&klfi&ij dV + 


/ n i 
JdV c 


UiSoa dS 


+ / (njCTji— Ti)SuidS + I njOjiduidS . 

Jdv ae Jdv c -dv ac 

Thus the stationarity condition 6Uhs = 0 requires 

— / Cijki^kiSoij dV + / njUiSajidS = 0, 

Jv c JdV c 

f (jijCrji — Ti)Sui dS = 0, 

JdV„' 

/ HjCjiduidS = 0. 

.11 ±lnt JbV.-BV.. 


The second and third equalities give the traction boundary condition (7) 
on dV ae and the inter-element equilibrium condition (14a), respectively. 
Prom Sec. 10.9, the first equality gives the Betrami-Michell compatibility 
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equations as the Euler equation with u, as the element boundary displace¬ 
ments for every element. The compatibility equations for isotropic materials 
are given in Eq. (10.9:22). An alternative proof of stress compatibility is 
as follows. For given tij, let u, be the elastic solution in V e with Uj = on 
dV e . Then the first equality above can be written as 



V>i.j 1 Uj, 7 ) C'ijkl^kl 


5a ij dV = 0 


For arbitrary equilibrium 5aij, the stresses Oki must satisfy the strain-stress 
relations 

Cij = 2 (V'itj T j t i ) = CijklGkl • 

Since eij are derived from displacement u’s, it implies that aki satisfy the 
Betrami-Michell compatibility equations. The variational formulation asso¬ 
ciated with biffs is called the hybrid stress principle , which forms the basis 
for the hybrid stress finite element model. 

Hybrid Displacement Principle. If the stresses satisfy the strain stress 
relations Eq. (6), one can express the stresses in terms of the strains or 
displacements and obtain the stress strain relations 


(Id) oij — Dijki^kl — T rq,fc)/2 . 

One can then eliminate o 's from Eq. (10) and derive a three-field hybrid 
functional (Tong 1970) 

(17) II HO~ ^HDe{Ui,Ui,fi) , 

all elements 


where 

II HDeiUi, Ui, Ti ) = [ Ti(v,i - Ui) dS 
J dV e 

+ J dV ^ Ti UidS , 

in which e’s are in terms of u’s as given in Eq. (2). The admissibility 
conditions require that xii are common along the common boundary of any 
two adjacent elements and satisfy the prescribed displacement condition 
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Eq. (3). The stationarity condition of YIhd is 

<Sn.ffD(ui,Wt)Ti) _ ^ < -J [(Dijkieki),j +bi}8uidV 

all elements l 

+ / (n j D ijkl eki-f i )6u i dS+ (u* - Ui)6TidS 

JdV' JdV c 

+ [ ( ti- Triads + [ TiSuidS 1=0. 

JdV„ e JdV'-dV at j 

The vanish of the first two integrals gives 

(18) {Dijkieki),j +bi = 0 in 14, 

(19) Ti = rijDijkieki on <914, 


which are the equilibrium equations and the stress traction relations 
Eqs. (5) and (13), respectively, with the stresses being in terms of strains 
as given by Eq. (16). Equation (18) is the Navier equation. The zero con¬ 
ditions of the last three integrals of 5Whd imply the matching conditions 
Eq. (12), the prescribed boundary traction conditions Eq. (14), and the 
continuity of T) across inter-element boundaries. The variational formula¬ 
tion associated with the functional above is called the hybrid displacement 
principle, which forms the basis for the hybrid displacement finite element 
model. 

The functional IIhd can be simplified if one requires that u’s satisfy the 
Navier equation and that T’s satisfy Eq. (19) a priori. Applying the Gauss 
theorem, one obtains a two-field hybrid functional 

(20) IIhdi = ^ IItfZ)ie(ni,ni), 

all elements 


where 


IIffZ?ie(ni,tij)— f UjDijkiekiUidS f Dijki^ij^kidV f 
Jdv c 2 Jv t Jd\ 


Ti UidS , 


in which e’s are strains in terms of the displacements. If the body force is 
zero, YIhdu reduces to 


Tltf Q\e(ni, ili) — / ^j^ijkl^kl ( n ) dS I Ti UidS , 

JdV t \ 2 J JdVv C 

which involves integration over 8V e only. One can show that the 
stationarity condition of YIhdi gives the matching conditions Eq. (12), 
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the prescribed boundary traction conditions Eq. (14), and the continuity 

V 

of njDijkieki{=Ti) across inter-element boundaries. The last formulation 
is very useful for problems involving singularities in finite element applica¬ 
tions. 

Pian (1995, 1998) reviewed various variational principles of mixed/hy¬ 
brid formulations. Interested readers are referred to published literature. 

Incompressible or Nearly Incompressible Materials. If the material 
is incompressible, one cannot use the principle of minimum potential energy 
directly because certain components of the elastic modulus tensor D^ki 
becomes infinite. However, the Reissner or hybrid stress functional is still 
valid which can be seen from Eq. (9b) for isotropic materials as the bulk 
modulus RT —> oo. 

We can establish a modified potential energy principle valid for incom¬ 
pressible or nearly incompressible materials. Consider an isotropic material 
with the elastic modulus tensor being 

Dijki = (^K - - Gj Sij6 kl + G(SikSji + 5uS jk ) 

nfy { vSijSkl bik&jl T SjlSjk \ 

= 2G A7^ + -2- )■ 

Here K is the bulk modulus, which becomes infinite if the material is in¬ 
compressible, i.e., when u = 0.5. Using the contact transformation, 

1 f f 
2 Dijki e ij e ki + P e kk = Ge^e^ — , 

where p is the hydrostatic pressure and eC(= %• —dye**,/3) is the deviatoric 
strain, we can write the strain energy density as 

\ j f p ^ 

( 21 ) — DijklCijCkl = Gc-ijCij — pc-kk ~~ > 

which is a function of both u’s and p. The deviatoric energy (= Ge'^e'^) is 
bounded for all real materials. 

We can now construct a mixed functional with both u’s and p as in¬ 
dependent field variables. The functional is suitable for applications to 
incompressible or nearly incompressible materials. (Tong 1969) 

(22) n Me(ui,p) = (Ge^e'ij - pe kk - - biU^jdV - J T* mdS . 
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Penalty Functional. A simple way to relax constraints in the integral 
formulation is to introduce penalty function in the functional. For instance, 
if a material is incompressible, one can require that 

(23) e kk = 0 

as a constraint equation. Equation (22) becomes 

(24) II Me(ui) = [ (Ge' i:j e' i: j - biiii) dV ~ f TiUidS 

Jv c Jav ac 

with Eq. (23) as the constraint condition. One can relax the constraint by 
using the functional 

(25) II Me(ui,p) = [ (Ge’ije'ij + pe kk - bim) dV - [ Ti Ui dS , 

Jv e Jav at 

or introducing a penalty function and expressing the functional in the form 

(26) Um c(ui,p)= ( (Ge'^e 1 ^ + ^e kk ejj-biUi) dV - f TiUidS, 

JV € V J JdVa* 

where c is an assumed positive constant of the order G or larger. The term 
associated with c is the penalty function, which enforces incompressibility 
when c —> oo. 

10.11. SAINT-VENANT’S PRINCIPLE 

In 1855, Barre de Saint-Venant enunciated the “principle of the elastic 
equivalence of statically equipollent systems of loads.” According, to this 
principle, the strains that are produced in a body by the application, to a 
small part of its surface, of a system of forces statically equivalent to zero 
force and zero couple, are of negligible magnitude at distances which are 
large compared with the linear dimensions of the part. When this principle 
is applied to the problem of torsion of a long shaft due to a couple applied 
at its ends, it states that the shear stress distribution in the shaft at a 
distance from the ends large compared with the cross-sectional dimension 
of the shaft will be practically independent of the exact distribution of the 
surface tractions of which the couple is the resultant. Such a principle is 
nearly always applied, consciously or unconsciously, when we try to simplify 
or idealize a problem in mathematical physics. It is used, for example, in 
devising a simple tension test for a material, when we clamp the ends of a 
test specimen in the jaws of a testing machine and assume that the action on 
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the central part of the bar is nearly the same as if the forces were uniformly 
applied at the ends. 

The justification of the principle is largely empirical and, as such, its 
interpretation is not entirely clear. 

One possible way to formulate Saint-Venant’s principle with mathemat¬ 
ical precision is to state the principle in certain sense of average as follows 
(Zanaboni, 10,3 1937, Locatelli, 10 ' 3 1940, 1941). Consider a body as shown in 
Fig. 10.12:1. A system of forces P in static equilibrium (with zero resultant 



Fig. 10.11:1. Fig. 10.11:2. 


force and zero resultant couple) is applied to a region of the body enclosed 
in a small sphere B. Otherwise the body is free. Let S' and S" be two 
arbitrary nonintersecting cross sections, both outside of B, with S" farther 
away from B than S'. Due to the system of loads P , stresses are induced in 
the body. If we know these stresses, we can calculate the tractions acting 
on the surfaces S' and S". Let the body be considered as severed into two 
parts at S", and let the system of surface tractions acting on the surface 
S' be denoted by R' which is surely a system of forces in equilibrium (see 
Fig. 10.12:2). Then a convenient measure of the magnitude of the tractions 
R' is the total strain energy that would be induced in the two parts should 
they be loaded by R' alone. Let this strain energy be denoted by Uri . 
We have, 

Ur > = J w (<r[p) dv , 

where W is the strain energy density function, and the stresses corre¬ 
spond to the loading system R'. Similarly, let the magnitude of the stresses 
at the section S" be measured by the strain energy Urh, which would have 
been induced by the tractions R" acting on the surface S" over the two 
parts of the body severed at S". Both Uri and Uru are positive quantities 
and they vanish only if R ', R" vanish identically. Now we shall formulate 
Saint-Venant’s principle in the following form (Zanaboni, 10 3 1937). 
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Let S' and S" be two nonintersecting sections both outside a sphere B. If 
the section S" lies at a greater distance than the section S' from the sphere 
B in which a system of self-equilibrating forces P acts on the body , then 

(1) U R „ < U R , . 


In this form, the diminishing influence of the self-equilibrating system 
of loading P as the distance from B increases is expressed by the func¬ 
tional Ur, which is a special measure of the stresses induced at any section 
outside B. The reason for the choice of Ur as a measure is its positive 
definiteness character and the simplicity with 
which the theorem can be proved. Fur¬ 
ther sharpening of the principle will be dis¬ 
cussed later. 

In order to prove the Theorem (1), we 
first derive an auxiliary principle. Let a self- 
equilibrating system of forces P be applied to 
a limited region B at the surface of an other¬ 
wise free elastic body C\ (Fig. 10.11:3). Let U x 
be the strain energy produced by P in C x . Let 
us now consider an enlarged body C x + C 2 by 
affixing to C x an additional body C 2 across a 
surface S which does not intersect the region 
B. When P is applied to the enlarged body 
C\ + C 2 , the strain energy induced is denoted 
by U x+2 . Then the lemma states that 

U x+2 < U x . 

Proof of the Lemma. To compute U x+2 , we imagine that the stresses 
in the enlarged body C\ + C 2 are built up by the following steps (see 
Fig. 10.11:3). First the load P is applied to C x . The face Si of C x is 
deformed. Next a system of surface tractions R is applied to C\ and C 2 on 
the surfaces of separation, Si and S 2 . R will be so chosen that the deformed 
surfaces Si and S 2 fit each other exactly, so that displacements of material 
points in C x and C 2 are continuous as well as the stresses. Now C x and C 2 
can be brought together and welded, and S becomes merely an interface. 
The result is the same if C x and C 2 were joined in the unloaded state and 
the combined body C x + C 2 is loaded by P. 

The strain energy U x + 2 is the sum of the work done by the forces in the 
above stages. In the first stage, the work done by P is U x . In the second 



Fig. 10.11:3. 

( 2 ) 
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stage, the work done by R on C '2 is U * R2 ; the work done by R on C\ consists 
of two parts, U R1 if C\ were free, and Up R , the work done by the system 
of loads P due to the deformation caused by R. Hence, 

(3) C/ 1+2 = U\ + U R1 + U R 2 + Up R . 

Now the system of forces R represents the internal normal and shear stresses 
acting on the interface S of the body C\ + GY It is therefore determined by 
the minimum complementary energy theorem. Consider a special variation 
of stresses in which all the actual forces R are varied in the ratio 1 : (1 + e), 
where e may be positive or negative. The work U R1 will be changed to 
(l + e) 2 U R1 , because the load and deformation will both be changed by a 
factor (1 + e). Similarly, U* R2 is changed to (1 + e) 2 U R2 . But Up R is only 
changed to (1 + c)Up R , because the load P is fixed, while the deformation 
is varied by a factor (1 + e). Hence, U 1+2 is changed to 

( 4 ) U' 1+2 = U\ + (1 + t) 2 U R 1 + (1 + t) 2 U R 2 + (1 + t)Up R . 

The difference between Eqs. (4) and (3) is 

AC/ 1+2 = e(2 U Rl + 2 U R2 + Up R ) + e 2 {U Rl + U R2 ). 

For C/ 1+2 to be a minimum, AC/ 1+2 must be positive regardless of the sign 
of e. This is satisfied if 

(5) 2 U R1 + 2U R2 + UpR = 0 • 

On substituting Eqs. (5) into (3), we obtain 

( 6 ) c/1+2 = u 1 -(ir Rl +uf a ). 

Since U R1 and U R2 are positive definite, we see that Lemma ( 2 ) is proved. 

Proof of Zanaboni theorem. Now we shall prove Saint-Venant’s 
principle embodied in Eq. (1). Consider an elastic body consisting of three 
parts C 1 + C 2 + C 3 loaded by P in B, as shown in Fig. 10.12:1. Let this body 
be regarded first as a result of adjoining C 2 + C 3 to Ci with an interface 
force system R', and then as a result of adjoining C 3 to Ci + C 2 with an 
interface force R”. We have, by repeated use of Eq. ( 6 ), 

^l + (2+3) ~U 1 - ( U RI1 -f U R ,( 2+ 3)) , 

U( l+2)+3 = C/1+2 - (C/fj»( 1+2) + Ur" 3 ) 

= Ui - (U R1 + U R2 ) - (f/fl«(i+2) + u R „ 3 ). 
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Equating these expressions, we obtain 

Ur'I + Ur‘( 2+3) = Uri + ^R2 + ^fi"(l+2) + Ur "3 > 

or, since and C/^ 2 are essentially positive quantities, 

(V U R'l + U *R’(2+3) > Ur"( 1 + 2 ) + U R"3 • 

This is Eq. (1), on writing Ur> for f7^, 1 + t/^,( 2+3 ), etc. Hence, the principle 
is proved. 

10.12. SAINT-VENANT’S PRINCIPLE — BOUSSINESQ-VON 
MISES-STERNBERG FORMULATION 

The Saint-Venant principle, as enunciated in terms of the strain energy 
functional, does not yield any detailed information about individual stress 
components at any specific point in an elastic body. However, such infor¬ 
mation is clearly desired. To sharpen the principle, it may be stated as 
follows (von Mises, 1945). “7/ the forces acting upon a body are restriced 
to several small parts of the surface, each included in a sphere of radius e, 
then the strains and stresses produced in the interior of the body at a finite 
distance from all those parts are smaller in order of magnitude when the 
forces for each single part are in equilibrium than when they are not.” 

The classical demonstration of this principle is due to Boussinesq (1885), 
who considered an infinite body filling the half-space z > 0 and subjected to 
several concentrated forces, each of magnitude F, normal to the boundary 
z — 0. If these normal forces are applied to points in a small circle B with 
diameter e, Boussinesq proved that the largest stress component at a point 
P which lies at a distance R from B is 

(1) of order F/R 2 if the resultant of the forces is of order F, 

(2) of order ( e/R)(F/R 2 ) if the resultant of the forces is zero, 

(3) of order ( e/R) 2 (F/R 2 ) if both the resultant force and the resultant 
moment vanish. 

(Cf. Secs. 8.8, 8.10.) 

These relative orders of magnitude were believed to have general validity 
until von Mises (1945) showed that a modification is necessary. 



(a) (b) (c) (d) 


Fig. 10.12:1. von Mises’ examples in which forces tangential to the surface of an elastic 
half-space are applied in a small area. 
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Consider the half-space z > 0 again. Let forces of magnitude F tangent 
to the boundary z = 0 be applied to points in a small circle B of diameter 
e. Making use of the well-known Cerruti solution (Sec. 8.8), von Mises 
obtained the following results for the four cases illustrated in Fig. 10.12:1. 
The order of magnitude of the largest stress component at a point P which 
lies at a distance R from B is 

(1) of order <7o = F/R? in case (a), 

(2) of order (e/R)ao in case (b), 

(3) of order (e/R)a o in case (c), 

(4) of order (e/R) 2 a 0 in case (d). 

The noteworthy case is (c), which is drastically different from what 
one would expect from an indiscriminating generalization of Boussinesq’s 
result, for in this case the forces are in static equilibrium, with zero moment 
about any axes, but all one could expect is a stress magnitude of order no 
greater than (e/R)cro, not {(/R) 2 a<j. von Mises found that in this case the 
order of magnitude of the largest stress component is reduced to (e/R) 2 a 0 
if and only if the external forces acting upon a small part of the surface are 
such as to remain in equilibrium when all the forces are turned through an 
arbitrary angle. (Such a case is called astatic equilibrium.) 

von Mises examined next the stresses in a finite circular disk due to 
loads on the circumference, and a similar conclusion was reached. (See 

Fig. 10.12:2.) 



la) 




(b) 


(0 



Id) 


Fig. 10.12:2. von Mises’ examples of a circular disk subjected to loads on the 
circumference. 
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These examples show that Saint-Venant’s principle, as stated in the 
traditional form at the beginning of this section, does not hold true. 

Accordingly, von Mises 10,3 proposed, and later Sternberg 10,3 proved 
(1954), the following mathematical statement of the Saint-Venant principle: 

Let a body be acted on by surface tractions which are finite and are 
distributed over a number of regions all no greater than a sphere of diameter 
e. Consider an interior point x whose distance to any of these loading areas 
is no less than a characteristic length which will be taken as unity, e is 
nondimensionalized with respect to this characteristic length. Then, as 
e-iO, the order of magnitude of the strain components at x is as follows: 

e(x, e) = 0(e p ) 

where 

(a) If the tractions have nonvanishing vector sums in at least one area, 
then in general, p > 2. (Note that the surface traction is assumed 
to be finite, so the resultant force -» 0 as e 2 , since the area on which 
the surface tractions act -4 0 as e 2 .) 

(b) If the resultant of the surface tractions in every loading area van¬ 
ishes, then p > 3. 

(c) If, in addition, the resultant moment of the surface tractions in 
every loading area also vanishes, then still we can be assured only 
of p > 3. 

(d) p > 4 in case of astatic equilibrium in every loading area, which 
may be described by the 12 scalar conditions 

[ T* dS = 0, [ T*XjdS = 0, (i,j = 1,2,3) 

JS(e) JS{f) 

for each loading area S(e), where T* represents the specified sur¬ 
face traction over S(e), and xj is the coordinate of the point of 
application of T*. 

If the tractions applied to 5(e) are parallel to each other and not tan¬ 
gential to the surface, then if they are in equilibrium they are also in astatic 
equilibrium, and the condition p > 4 prevails. 

If, instead of prescribing finite surface traction, we consider finite forces 
being applied which remain finite as e —► 0, then the exponent p should 
be replaced by 0, 1, 2 in the cases named above, as was illustrated in 
Figs. 10.12:1 and 10.12:2. 

The theorem enunciated above does not preclude the validity of a 
stronger Saint-Venant principle for special classes of bodies, such as thin 
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plates or shells or long rods. With respect to perturbations that occur at 
the edges of a thin plate or thin shell, a significant result was obtained by 
K. O. Friedrichs 10 3 (1950) in the form of a so-called boundary-layer theory. 
With respect to lateral loads on shells, Naghdi 10 3 (1960) obtained similar 
results. 

10.13. PRACTICAL APPLICATIONS OF 
SAINT-VENANT’S PRINCIPLE 

It is well-known that Saint-Venant’s principle has its analogy in hydro¬ 
dynamics and that these features are associated with the elliptic nature 
of the partial differential equations. If the differential equations were hy¬ 
perbolic and two-dimensional, local disturbances may be propagated far 
along the characteristics without attenuation. Then the concept of the 
Saint-Venant’s principle will not apply. For example, in the problem of the 
response of an elastic half-space to a line load traveling at supersonic speeds 
over the free surface, the governing equations are hyperbolic, and we know 
that any fine structure of the surface pressure distribution is propagated all 
the way to infinity. 

On the other hand, one feels intuitively that the validity of Saint- 
Venant’s principle is not limited to linear elastic solid or infinitesimal dis¬ 
placements. One expects it to apply in the case of rubber for finite strain or 
to steel even when yielding occurs. Although no precise proof is available, 
Goodier 10 3 (1937) has argued on the basis of energy as follows: 

Let a solid body be loaded in a small area whose linear dimensions are 
of order e, with tractions which combine to give zero resultant force and 
couple. Such a system of tractions imparts energy to the solid through the 
relative displacements of the points in the small loaded area, because no 
work is done by the tractions in any translation or rotation of the area as 
a rigid body. Let the tractions be of order p. Let the slope of the stress 
strain curve of the material be of order E. (The stress-strain relationship 
does not have to be linear.) Let one element of the loaded area be regarded 
as fixed in position and orientation. Then since the strain must be of order 
p/E, the displacements of points within the area are of order (pe/E). The 
work done by the traction acting on an element dS is of order (pep - dS/E). 
The order of magnitude of total work is, therefore, p 2 c 3 /E. Since a stress 
of order p implies a strain energy of order p 2 /E per unit volume, the region 
in which the stress is of order p must have a volume comparable with e 3 . 
Hence, the influence of tractions cannot be appreciable at a distance from 
the loaded area which are large compared with e. 
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Goodier’s argument can be extended to bodies subjected to limited 
plastic deformation. In fact, the argument provides an insight to practi¬ 
cal judgement of how local self-equilibrating tractions should influence the 
strain and stress in the interior of a body. 

An engineer needs to know not only the order of magnitude comparison 
such as stated in von Mises-Sternberg theorem; he needs to know also 
how numerically trustworthy Saint-Venant’s principle is to his particular 
problem. Hoff 10,3 (1945) has considered several interesting examples, two 
of which are given in Figs. 10.13:1 and 10.13:2 and will be explained below. 

In the first example, the torsion of beams with different cross-sections is 
considered. One end of the beam is clamped, where cross-sectional warping 
is prevented. The other end is free, where a torque is applied by means of 
shear stresses distributed according to the requirements of the theory of 
pure torsion. The difference between the prescribed end conditions at the 
clamped end from those assumed in Saint-Venant’s torsion theory (Sec. 7.5) 
may be stated in terms of a system of self-equilibrating tractions that act 



Fig. 10.13:1. Hoff’s illustration of Saint-Venant’s principle. 
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Fig. 10.13:2. Hoff’s example illustrating the slow decay of self-equilibrating forces in a 
space framework. 

at the clamped end. Timoshenko has given approximate solutions to these 
problems. Hoff’s example refers to beams with dimensions as shown in 
the figure and subjected to a torque of 10 in. lb. Due to the restrictions 
of warping, normal stresses are introduced in the bar in addition to the 
shearing stresses of Saint-Venant’s torsion. For the rectangular beam, the 
maximum normal stress at the fixed end is equal to 157 lb/sq in. For 
the other two thin-walled channel sections, the maximum normal stresses 
at the fixed end are 1230 and 10,900 lb/sq in., respectively, for the thicker 
and thinner sections. In Fig. 10.13:1 curves are shown for the ratio of 
maximum normal stress / max (r) in any section x divided by the maximum 
normal stress / max (0) in the fixed-end section, plotted against the ratio 
distance x of the section from the fixed end of the bar, divided by the total 
length L of the bar. Inspection of the curves reveals that, while in the case 
of the solid rectangular section the normal stress caused by the restriction 
to warping at the fixed end is highly localized, it has appreciable values 
over the entire length of the channel section bars. Consequently, reliance 
on Saint-Venant’s principle in the calculation of stresses caused by torsion 
is entirely justified with the bar of rectangular section. In contrast, stresses 
in the thin-walled section bars depend largely upon the end conditions. 

Hoff’s second example refers to pin-jointed space frameworks. In one 
case a statically determinate framework is considered, to one end of which 
a set of four self-equilibrating concentrated loads are applied, as shown in 
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Fig, 10.13:2. The figures written on the elements of the framework represent 
the forces acting in the bars measured in the same units as the applied loads. 
Negative sign indicates compression. It can be seen that the effect of the 
forces at one end of the structure is still noticeable at the other end. 

Hoff’s examples show that Saint-Venant’s principle works only if there 
is a possibility for it to work; in other words, only if there exist paths for 
the internal forces to follow in order to balance one another within a short 
distance of the region at which a group of self-equilibrating external forces 
is applied. This point of view is in agreement with Goodier’s reasoning. 

Hoff’s examples are not in conflict with the von Mises-Sternberg theo¬ 
rem, for the latter merely asserts a certain order of magnitude comparison 
for the stress and strain as the size of the region of self-equilibrating loading 
shrinks to zero, and does not state how the stresses are propagated. On 
the other hand, although Goodier’s reasoning does not provide a definitive 
theorem, it is very suggestive in pointing out the basic reason for Saint- 
Venant’s principle and can be used in estimating the practical efficiency of 
the principle. 

10.14. EXTREMUM PRINCIPLES FOR PLASTICITY 

In this section we shall derive the principles of minimum potential en¬ 
ergy and minimum complementary energy for displacement and stress incre¬ 
ments in elastic-plastic bodies (Hill 1950). We consider a body V subjected 

V_ 

to prescribed body forces in V, traction 7\ on dV a , and displacements on 
dV u . Suppose that u* and oy are the elastic-plastic solution of the static 
problem and that duj and d<7y are the increment solution for infinitesimal 

y_ 

increments dbi in V, d Ti on dV„ and dUi on dV u . Then dery satisfy the 
equilibrium equations 

(1) dcrijj 4- dbi — 0, in V , 

the traction conditions 

n 3 doji — dTi, on dV„ , 

and can be expressed in terms of displacement increments duj through the 
strain increments de m [Eq. (6.10:11), (6.11:18) or (6.14:17)] 

dctij = Dijkide k i for / < 0 

= D tjki de ki for / > 0, 

where D'? kl is given in Eq. (6.11:11), (6.11:18) or (6.14:17) and / is the 
yield function. 
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Extremum Principles for Displacement Increment. Let duj be the 

incremental solution over a volume V. Let du 3 be a kinematically admissi¬ 
ble displacement field that it obeys the internal constraints, if any, and that 
it satisfies du t = diii on dV u . The corresponding strain and stress incre¬ 
ments are (deij,d&ij). Consider the following functional for displacement 
increments duj 


(3) U p (dui) = ^ f daijdeij dV — [ dbiduidV - [ dTiduidS. 
1 Jv Jv JdV a 


We can show that 


II P (du) - rip(rfti) = ^ J (daijde i:j - da r] de tJ ) dV - j dbi(din - dm) dV 


-L 


d Ti (din ~ dm ) dS. 


By taking into account of the relation 

/ dTi (ddi - dui)dS = [ daij(de.ij - de vl )dV 
JdV a Jv 

+ / doij j (dm - dm ) dV , 

Jv 

the difference can be written as 

(4) n p (du i ) - Y\. p (dui) = - J (daijdeij + da lJ de lj - 2da lj de i j) dV. 

With the substitution of de {j = DjJ kl da k i +de p ij and de i3 = D^daki+de^, 
the integrand becomes 


D ijkM°ij ~ daij)(dd k i - da kl ) + ddijde + da^de^ - 2 da^de^ . 

The first term is obviously non-negative due to the positive de fini teness of 
Dijkb For perfectly plastic materials, 


daijde^ ------ da^de ?■ = 0 , 

daijde^ = dkn i3 d,a l3 < 0, 

[see Eqs. (6.10.3,4)] because dk > 0 and riijdaij < 0. Consequently, the 
remaining terms of the integrand are never negative, so that 

(5) II p (dui) > Ilp(duj). 

The equality holds only if da 3 j = d,a %3 . 
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For work-hardening materials, let 
(6) df = n k idoki ■ 

with df < 0 denoting elastic unloading. From Eq. (6.11:15), the flow equa¬ 
tion can be written as 

K p de r ‘ } = H(df )n^ df , 

KPde^ = H(df)n i} df. 

The sum of the remaining terms of the integrand becomes 
(8) R=~ \{df?H{df) + (df) 2 H(df) - 2 (df)(df)H(df)]. 

We can show that 

if df < 0 and df < 0, 

if df > 0 and df > 0, 

if df < 0 and df > 0, 

if df > 0 and df < 0. 

Consequently, the integrand is never negative except when d&ij = daij, then 
Ilp(dUt) = Tlp(dui). In other words, Eq. (5) also holds for work-hardening 
materials, d&ij = da^ implies that diij = de^ and therefore din = dui. 

The extremum principle for the functional Eq. (3) is also called the 
principle of minimum potential energy for the displacement increment. 

Extremum Principle for Stress. The complementary extremum prin¬ 
ciple concerns a statically and plastically admissible stress increment field. 
A stress field is statically admissible if it satisfies the equilibrium equations 
in V and the traction condition on dV„. A stress field is plastically admis¬ 
sible if it obeys the yield criterion everywhere in V. The complementary 
functional for the stress increments daij is 

(10) n. c (daij) = ^ f daijdeijdV — f UjdtJijduidS . 

2 Jy J dVa 

Expressing deij in terms of da^ and de? . we can rewrite Eq. (10) as 

Ucidfftj) = \ f {DfXidaijdoki + doijde?.) dV - f n^da^duidS , 

* JV JdV u 
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where 

deride ?. = 0 , for perfectly plastic materials, 

(U) 1 

= — ( df) 2 H(df ), for work-hardening materials. 

and df(= nijdoij) is defined in Eq. (6). Clearly the functional is a function 
of dffij only. Let drr^ be the plastic solution of the increment and dciij be 
a statically admissible plastic stress field. Then 

(12) II c (ddij) - U c (daij) [dd l3 de {j + do-yde^ - 2dcr i jde v ij ) dV . 

In deriving Eq. (12), we have used the relation that 

I (ddij — d&ij) deij dV = / ri l {dd l] — daij) diij dS 

JV JdV u 

for any two admissible stress fields with a continuous displacement duj — 
dUj on 8V U . The integrand is similar to that in Eq. (4) and may be shown 
to be positive definite by the same method. Therefore, we have 

(13) Ucid&ij) > \\ C (dcr ZJ ). 

The equality holds only if da^ = da^. 

It can be shown that 


II c (d(7jj) — Tl.pidui') , 


which implies 

(14) II c (d<7ij) > n c (d<r,j) = —Up{dui) > —fl p (duij). 

The extremum principle for Eq. (10) is also called the principle of min¬ 
imum complementary energy for da z j. 

The stationary condition of Slip = 0 or cdl c — 0 for functionals in 
Eqs. (3) and (10) are useful for constructing “weak-form” increment solu¬ 
tions. 

Rigid-Plastic Materials. For rigid-plastic materials, the elastic strain- 
increment is zero, and therefore, 

de^ = i (duij + du jt i) = deF . 


(15) 
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The principle of extremum plastic work becomes 

(16) (Tijdeij — D p (d eij ) ■ ‘ Gijd,Cij j 

where the strain increment dey is normal to the yield surface at the stress 
point Gij in the stress space and is any stress laying inside or at another 
point on the yield surface. It is emphasized that D p (deij) is a function of 
deij only. 

Let us define another functional 

(17) U(dui)= f D p (deij) dV - f bid Ui dV - [ Ti d Ui dS 

Jv Jv JdV„ 

as a function of the velocity field solution duj. Let dui be a kinematically 
admissible with its associated stress <7y satisfying the yield criterion in part 
or all of V [i.e., f{Gij,T,£i) < 0 or / = 0, e lj = nydA]. Then 

(18) n(diii) - U(dui) = J [D p (de tJ ) - D p (deij) ~ Gij(deij - de i} )] dV 

— J [I^p(de^j) (7ijdCij ] (1 V, 

By the maximum principle of plastic dissipation the integrand is nonnega¬ 
tive. Therefore, 

(19) n (din) > n(dui). 

10.15. LIMIT ANALYSIS 

We define the critical state of a body as a large increase in plastic 
deformation, much larger than the elastic deformation, with little or no 
increase in load. In other words, it is a state of impending plastic col¬ 
lapse or incipient plastic flow in which de^ 0 under constant load (i.e., 

y_ 

dbi = d T t = dUi = 0). Therefore, 

( 1 ) f dbiduidV + j dTi duidS = f daijdeijdV 

Jv JdV„ Jv 

= J (Dfjljdo^doki + dGijde^) dV = 0. 

The positive definiteness of Df^dG^da^i combined with Drucker’s inequal¬ 
ity implies daij = 0. This in turns implies de\ 3 = 0 and de i3 = de? , i.e., at 
impending plastic collapse the plastic flow is rigid-plastic. This result was 
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first noted by Drucker et al. (1951). It makes possible to use rigid-plastic 
formulation to establish the upper and lower bounds in limit analysis. 

Lower Bound Theorem. Consider the rigid-plastic deformation with 

known body force bi in V, traction Tj on dV a and increment dUi — 0 on 
dV u . Let the applied loads and prescribed displacements be respectively 
written in the form Pbi, PTi and Pdu { , where P is a load parameter. We 
further let (chq, deij and aij) be the actual solution at plastic collapse 
associated with the load parameter Pi. We have 

f D p (deij)dV - j UijdeijdV = Pi ( ( biduidV + j TiduidS 

Jv Jv \Jv JQV C 


If a statically admissible stress a i} is in equilibrium with Plb^i and Plb T'i, 
then 

(2) f dijdea dV = Pbq ( f biduidV + [ TiduidS 

Jv \Jv J dv„ 

Using Eq. (2) and the principle of maximum plastic work for rigid-plastic 
deformation, D p (de{j) > (f^deij, we obtain a lower bound of the collapse 
load 


(3) 


Pi.r = 


/ 'Tj deij 

Jv _ 


dV 


f biduidV + f TiduidS 

Jv JdVa 

f D p (deij)dV 

_ Jv_ _ 

j biduidV + j TiduidS 

Jv Jb 


= Pi- 


Since dui and de^ are generally not known, we cannot use Eq. (2) directly 
to evaluate Plb- 

Upper Bound Theorem. For a kinematically admissible increment dUi 
with dUi = 0 on dV u , we have 


(4) 


/ (Tijdeij dV = Pi ( / bidui dV + 

Jv \Jv Jd\ 


Ti dva dS 


where is the solution at impending collapse and Pi is the limit load. Let 
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us define 

( 5 ) 


Pub 


f D p (deij)dV 

Jv _ 


f biduidV + [ T 

JV JdV a 


Ti duidS 


Then using Eq. (4) and D p (deij) > <Jijdeij for rigid-plastic deformation, 
we conclude that 


( 6 ) 


Pub = Pi 


I D p {deij) 
Jv _ 

X 


dV 


(Judea dV 


>Pi- 


If one has found a kinematically admissible velocity field having its strain 
rate associated with a statically admissible stress field everywhere^ in the 
domain, then one has a complete solution. This solution predicts the correct 
collapse load using limit analysis. However, this may not be the exact 
solution, because the solution may not be unique. 


Lower and Upper Bound Loci for Multiple Load Parameters. The 

results given in Eqs, (3) and (4) can be generalized to include several pa¬ 
rameters P/ that 


( 7 ) 


Fi = Y,Pi&)i> 

I 

prescribed 

I 

prescribed 

dUi = Y / Pi(du i )i , 

prescribed 


i 

For a kinematically admissible velocity 
velocities 


body force in V , 
traction on dV a , 
displacement increment in dV u . 
field dv.i, we define the generalized 


( 8 ) 


dvi 


= f ( b i ) l du i dV+ f (Ti)[dv,i dV. 
Jv JdV a 


Let (dui, deij,Uij) be the solution at collapse with loading parameters (P|)/, 
then 

J crijdeijdV = YjPOidvj . 


tin the rigid region where deij = 0, the question of association does not arise. 
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For a statically admissible plastic stress field aij , we have 

(9) J dijdeijdV = Y^,{Plb)i 

= J^(Plb)/ dvi. 

i 

The extremum of plastic dissipation implies that 

(10) y^(- Pi)i dvi > ]P(Plb)/ dvj 

I I 

The right hand side of the inequality gives a lower bound locus of the 
collapse loads. 

For a kinematically admissible increment du* and its associated strain- 
increment de\ } and stresses <rj., we have 

(11) ^D p (de; j )dV = '£(P UB ) I 

- 'y'jPuB')! dvi , 

/ 

j v jde*j dV = ^(Pi)i dvj . 

Then 

(12) £(P0/dv7<X>Pt/B)i^ 

/ / 

gives an upper locus of the collapse loads, which can be computed explicitly 
from the admissible velocity field du*. More general theorems of limit 
analysis have introduced by Salencon (1977). Interested readers are referred 
to the literature. 

We shall consider a multi-parameter example of a sandwich beam consist 
of two equal thin flanges of cross-sectional area A and length L. A web of 
negligible longitudinal strength separates the flanges at a distance d apart. 
The beam is under an axial force P and a bending moment M, which are 
treated as two independent loading parameters. 

For lower bound analysis, the statically admissible stresses are 

<r u= L ± K 

m 2A ± Ad ’ 


f (bi)idUid,V + [ {Ti)jdui dV 

JV JdVa 


f {b i ) I du i dV+ f 
JV JdV„ 


(: T l ) I du l dV 


( 13 ) 
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where a u and ai are the stress in the upper and lower flanges, respectively. 
The yield criterion requires that both |cr„| and |<r/| are less or equal to cry, 
that is, 


(14) 


P M 
P ± 


< 1 . 


which is the lower bound locus with Pi = 2 Aoy and Mi = ay Ad. Since 
the problem is statically determinant, the lower bound locus is the actual 
limit locus with P/ and Mi being the limit loads for simple tensile and pure 
bending, respectively. 

For upper bound analysis, we use the elongation increments of the 
upper and lower flanges dAu u and dAui as the kinematically admissible 
increment. Then the plastic dissipation is 

I D p {de*A dV = Aay(\dAu u \ + |dAtq|), 

Jv 

and the external work is 


V\(Pub)i f {T)idu*dS=A 
i Jav 



M\ 

Ad) dAU ' 


By equating the plastic dissipation and the external work, we obtain the 
upper bound locus 


( 


P M_\ 
Pi + Mi) 


dAu u + 



M\ 

Mi) 


dAu u = |dAu u | + dAui \. 


PROBLEMS 

10.3. Derive the differential equation and permissible boundary conditions 
for a membrane stretched over a simply connected regular region A by minimizing 
the functional I with respect to the displacement w\p(x, y) being a given function 

/ = i JJ (w* + Wy) dxdy — JJ put dxdy. 

10.4. Obtain Euler’s equation for the functional 

I' = i j j [u/^ + Wy + (V 2 w + p) 2 — 2 pw\ dxdy. 

Am. V 4 w — V 2 u» + V 2 p — p = 0. 

Note-. Compare I' with I of Problem 10.3. Since V 2 w + p = 0 for a mem¬ 
brane, the solution of Problem 10.3 also satisfies the present problem. But the 
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integrand of I' contains higher derivatives of w and the equation SI' = 0 is a 
sharper condition than the equation 51 = 0. These examples illustrate Courant’s 
method of sharpening a variational problem and accelerating the convergence of 
an approximating sequence in the direct method of solution of variational prob¬ 
lems. 

10.5. Bateman’s principle in fluid mechanics states that in a flow of an ideal, 
nonviscous fluid (compressible or incompressible) in the absence of external body 
forces, the “pressure integral” 


/// 


pdxdydz , 


which is the integral of pressure over the entire fluid domain, is an extremum. 
Consider the special case of a steady, irrotational flow of an incompressible fluid, 
for which the Bernoulli’s equation gives 


, pV2 t P 

p = const.-= const. — — 

2 2 



where <f> is the velocity potential, Derive the field equation governing </> and 
the corresponding natural boundary conditions according to Bateman’s principle. 
[H. Bateman, Proc. Roy. Soc. London, A 125 (1929) 598-618.] 

10.6. Let 


T — y ] Pnv{t, qi ,.. ., Qm)Qii(t)q^(t ), 
U = U{t,q u ...,q m ), 


where <j M (t) = dq^/dt, and q^(to) = a M , q M (ti) = b^, (p = 1,... ,m); a^, b, t are 
given numbers. Derive the Euler differential equations for the variational problem 
connected with the functional 





Ldt. 


Prove that 

d{T + U) 
dt ’ 

if dU/dt — 0, dP^/dt = 0, p, u = 1,... ,m. Hint: Use the Euler’s relation for 
homogeneous functions. 

10.7. Find the curves in the x, y-plane such that 


\J2E — n 2 y 2 ds , ds 2 = dx 2 + dy‘ 
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is stationary, where E and n are constants and the integral is taken between fixed 
end points. 

10.8. Let D be the set of all functions u(x) with the following properties: 


f (1) 

u(x) = ai sin irx 4- a2 sin 2irx , 

D : j (2) 

ai,a2 are real, arbitrary numbers 

1(3) 

aj a| ^ 0 . 


Consider the functional 

J[u] = 

for all functions u(x ) in D. 

Questions: 

(a) Give necessary conditions for a function u'(x) in D so that u*(x) furnishes 
an extremum of the functional J. 

(b) Show that there exist exactly two functions 

U\(x) = an sin7rx -f ai 2 sin27rx, 

112 ( 2 ) = a2i sin7rx + a22 sin27rx, 

notwithstanding a constant factor, which satisfy these necessary conditions 
mentioned in (a). 

(c) Show that for one of the functions mentioned in (b), say, uj (x), the inequality 

J[ui] < J[u] 

holds for all functions u(x) in D. Then show that for the other solution, 
112 ( 2 ), the inequality 

J[u] < J[u 2 ] 

holds for all functions u(x) in D. 

(d) Does there exist any relation between J[ui], J[ti 2 ] and the eigenvalues of the 
eigenvalue problem 

— u" (x) = \u(x ), 0 < x < 1, 

with 

u( 0) = ti(l) = 0. 

10.9. Clapeyron’s theorem states that, if a linear elastic body is in equi¬ 
librium under a given system of body forces Fi and surface forces Ti, then the 
strain energy of deformation is equal to one-half the work that would be done by 
the external forces (of the equilibrium state) acting through the displacements Uj 
from the unstressed state to the state of equilibrium, i.e., 

f FiUi dv + f Ti UidS = 2 f W dv . 

JV Ja JV 

Demonstrate Clapeyron’s Theorem for: 


/ 0 V(g)] 2 <fo 
J 0 l [«(*)]» ds 
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(a) A simply supported beam under its own weight [Fig. Pl0.9(a)]. 



(o) (b) (c) 

P10.9. 


(b) A rod under its own weight [Fig. P10.9(b)], 



(c) A strip of membrane of infinite length and width b [Fig. P10.9(c)], under 
a constant pressure p 0 , for which 


w = iuo(l - 4sc 2 /6 2 ) , 

10.10. The “poker chip’’ problem (Max 
Williams). To obtain a nearly triaxial 
tension test environment for polymer ma¬ 
terials, a “poker chip” test specimen (a 
short circular cylinder) is glued between 
two circular cylinders (Fig. P10.10). 

When the cylinder is subjected to simple 
tension, the center of the poker chip is 
subjected to a triaxial tension stress field. 

Let the elastic constants of the media be 
Ei, G\, v\ and Ei, Gi, 1 ^ 2 , as indicated 
in Fig. P10.10, with E\ E 2 , — 0.5. 

Assume cylindrical symmetry, and obtain 
approximate expressions of the stress field 
by the following methods. 

(a) Use the complementary energy theorem and a stress field satisfying stress 
boundary conditions and the equations of equilibrium. 

(b) Use the potential energy theorem and assumed displacements satisfying dis¬ 
placement boundary conditions. 

(c) Experimental results suggest that the following displacement field is reason¬ 
able: 


Wo 3/5/1 2 \ raP o1>4 

T =^(1-^64 — 



w = 


wo b 


u = 



— h < z < h , 
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where g(r) is as yet an unknown function of r. Obtain the governing equation 
for g(r) and its appropriate solution for this problem by attempting to satisfy 
the following equilibrium equations in some average sense with respect to the 
z-direction: 

d(7,r dffrz Oyr ~ Of)e _ Q 

dr dz r ’ 

dcTrz da zz £rz _ 

dr dz r 

10.11. Consider a pin-ended column of length L and 
bending stiffness El subjected to an end thrust P. A 
spring is attached at the middle of the column as shown 
in Fig. P10.11. When the column is straight the spring 
tension is zero. If the column deflects by an amount A, 
the spring exerts a force R on the column: 

R=KA-aA 3 , (K > 0,a > 0). 

Derive the equation of equilibrium of the system. It is 
permissible to use the Euler-Bernoulli approximation for 
a beam, for which the strain energy per unit length is 

^EI • (curvature) 2 

and 

bending moment = El • (curvature). 

10.12. For the column of Problem 10.11 under the axial load P, is the 
solution unique? Under what situation is the solution nonunique? What are the 
possible solutions when the uniqueness is lost? 

10.13. A linear elastic beam of bending stiffness El is supported at four 
equidistant points ABCD , Fig. P10.13. The supports at A and D are pin-ended. 
At B and C, the beam rests on two identical nonlinear pillars. The characteristic 
of the pillars may be described as “hardening” and is expressed by the equation 

KA = R-0R 3 (K > 0, f3 > 0), 

where R is the reaction of the pillar, A is the downward deflection of the beam 
at the point of attachment of the beam to the pillar, K and /3 are constants. 

A load P is applied to the beam at the midspan point. Find the reactions 
at the supports B and C. One of the two minimum principles (of potential 
energy and of complementary energy) is easier to apply to this problem. Solve 
the problem by a variational method with the appropriate minimum principle. 
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P10.13. P10.14. 


10.14. Consider a square plate loaded in the manner shown in Fig. P10.14. 
Derive Euler’s equation and the natural boundary conditions for V to be a min¬ 
imum when w(x,y) is varied. 



10.15. Consider a material with the elastic modulus tensor satisfying 

Dijkk = Dkkij = 3 KSij = ESij/( 1 — 2l>) . 

The bulk modulus K is infinite while D tJ ki — StjSkiK is bounded if the material 
is incompressible. Show that the contact transformation gives 

Dijkieijeu/Z - pekk = D' ijkl e' i:j e'ki/2 - p 2 /(2I <), 

where D' ljk t = Dijki — KS tJ Ski, p is the hydrostatic pressure and eC is the devia- 
toric strain and that the potential energy of the element is 

U.Me(ui,p) = J Q D'ijkie'ije'ki + pe kk - ^ - bm^J dV - J TimdS. 

The deviatoric energy D' )7 . 1 e' J e' fc( /2 is bounded. 
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HAMILTON’S PRINCIPLE, 
WAVE PROPAGATION, 
APPLICATIONS OF GENERALIZED 
COORDINATES 


In dynamics, the counterpart of the minimum potential energy theorem 
is Hamilton’s principle. In this chapter we shall discuss this important 
principle and its applications to vibrations and wave propagations in beams. 

Toward the end of the chapter a brief discussion of computational meth¬ 
ods of solving variational problems is given. The basic idea is to apply the 
concept of generalized coordinates to obtain approximate solutions for a 
continuous system by reducing it to one with a finite number of degrees 
of freedom. Several important methods — those of Euler, Rayleigh-Ritz- 
Galerkin, and Kantrovich — will be outlined. Much more about computa¬ 
tional mechanics is given later in Chapters 17-21. 


11.1. HAMILTON’S PRINCIPLE 


For an oscillating body, with displacements u, so small that the acceler¬ 
ation is given by d 2 Ui/dt 2 in Eulerian coordinates (Sec. 5.2), the equation 
of small motion is 


(1) 


Gji'j ~b Ft — P 


d 2 Ui 

~W 


where p is the density of the material and is the body force per unit 
volume. Let us again consider virtual displacements as specified in 
Sec. 10.7, but instead of a body in static equilibrium we now consider a 
vibrating body. The variations dui must vanish over the boundary surface 
S u , where values of displacements are prescribed; but are arbitrary, triply 
differentiable over the domain V ; and are arbitrary also over the boundary 
surface S a , where surface tractions are prescribed. The total boundary 
surface of the volume V is 5, and S = S u + S„. 

The virtual work done by the body and surface forces is, as in Sec. 10.7, 


[ FiSuidv + f Ti SuidS . 

Jv Js 
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V 

The last integral can be transformed on introducing Ti= (JijVj and using 
Gauss’ theorem, as in Sec. 10.7: 


/ Ti SuidS = I OijVjSuidS — / (oijdui) t jdv 

Js Js Jv 

— J (TijjSuidv + J a i: 


iSui jdv. 


By Eq. (1), and by the symmetry of aij, the right hand side is equal to 




Suidv + 


L 


auSeudv . 


Therefore, we obtain the variational equation of motion 


( 2 ) A J UijSeijdv = j ^Fi - Suidv + J Ti Su^dS. 

As before, this general equation can be stated more concisely if we 
introduce various levels of restrictions. Thus, if the body is perfectly elastic 
and a strain energy function W exists, then the variational equation of 
motion can be written as 

(3) A sj Wdv = (^Fi- S^dv + J 6 Ui dS. 


The variations Sui are assumed to vanish over the part of the boundary 
S u where surface displacements are prescribed. Hence, the limit for the 
surface integral can be replaced by S If the variations Sui were identified 
with the actual displacements ( du{/dt)dt , then the result above states that, 
in an arbitrary time interval, the sum of the energy of deformation and the 
kinetic energy increases by an amount that is equal to the work done by 
the external forces during the same time interval. 

If the virtual displacements Sui are regarded as functions of time and 
space, not to be identified with the actual displacements, and the varia¬ 
tional equation of motion (3) is integrated with respect to time between 
two arbitrary instants to and 1 1 , an important variational principle for the 
moving body can be derived. We have 


f'f 

J to Jv 


SWdvdt = 


r*j 

Jio Jv 

-f'*J 

Jto JV 


FiSuidv + 

d 2 


r*i - 

J tn J So 


V P dP 


^0 

duidv . 


(4) 


Ti SmdS 
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Calling the last term J, inverting the order of integration, and integrating 
by parts, we obtain 


<5) Z/tH, -J/ v L ^{ p ^r + tt su ') d ‘- 


The dp/dt term can be ignored because dp/dt = —dpiii/dxi according 
to the equation of continuity, and thus the term Suidp/dt is an order of 
magnitude smaller than other terms in this equation. Let us now impose 
the restriction that at the time to and ti, the variations 6ui are zero at all 
points of the body; i.e., 


( 6 ) 


Sui(to) = (fui(ti) = 0 


in V. 


Then 

( 7 ) 


'--a**- 

=- £ L pd ii sd it dvdt =- £ 5 t dvdt 

-*£ 


SKdt , 


where 

( 8 ) 


1 f du, dui 


is the kinetic energy of the moving body. Therefore, under the assumption 
(6), Eq. (4) becomes, 

(9) A f S(U - K)dt = f f FiSuidvdt + f f Ti 6uidSdt , 

J to «/to J V J to * tSff 

where U represents the total strain energy of the body, 


U 


-L 


Wdv. 


If the external forces acting on the body are such that the sum of the 
integrals on the right-hand side of Eq. (9) represents the variation of a 
single function — the potential energy of the loading — A, 


(10) 


[ FiSuidv + j Ti SuidS = —(5A, 

Jv Js„ 
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then Eq. (9) can be written as 


(11) A 


The term 


i [ \u - K + A)dt = 0, 
Jtn 


(or sometimes — L) is called the Lagrangian function and Eq. (11) represents 
Hamilton’s principle, which states that: 

The time integral of the Lagrangian function over a time interval to to 
t\ is an extremum for the “actual” motion with respect to all admissable 
virtual displacements which vanish, first, at instants of time to and t\ at 
all points of the body, and, second, over S u , where the displacements are 
prescribed, throughout the entire time interval. 

To formulate this principle in another way, let us call u l (xi, x%,X 3 ; f) a 
dynamic path. Then Hamilton’s principle states that among all dynamic 
paths that satisfy the boundary conditions over S u at all times and that 
start and end with the actual values at two arbitrary instants of time to and 
ti at every point of the body, the “actual” dynamic path is distinguished by 
making the Lagrangian function an extremum. 

In rigid body dynamics the term U drops out, and we obtain Hamilton’s 
principle in the familiar form. The symbol A replaces the usual symbol V 
in books on dynamics because we have used V for something else. 

Note that the potential energy — A of the external loads exists and is 
a linear function of the displacements if the loads are independent of the 
elastic displacements, as is commonly the case. In aeroelastic problems, 
however, the aerodynamic loading is sensitive to the small surface displace¬ 
ments up, moreover, it depends on the time history of the displacements 
and cannot be derived from a potential. Hence, in aeroelasticity we are 
generally forced to use the variational form (9) of Hamilton’s principle. 

In some applications of the direct method of calculation, it is even de¬ 
sirable to liberalize the variations Sui at the instants to and tj and use 
Hamilton’s principle in the variational form (4) which cannot be expressed 
elegantly as the minimum of a well-defined functional. On the other hand, 
such a formulation will be accessible to the direct methods of solution. On 
introducing Eqs. (5), (7), and (10), we may rewrite Eq. (4) in the form: 


(13) / 1 S(U-K + A) dt 
J to 

= [ tl [ FiSmc 
Jto Jv 


FiSui dvdt + f f Ti dui dS — [ p —- i 
Jtn Js Jv dt 
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Here U is the total strain energy, K is the total kinetic energy, A is the 
potential energy for the conservative external forces, Ft and T. L are, respec¬ 
tively, those external body and surface forces that are not included in A, 
and Su{ are the virtual displacements. 

Problem 11.1. Prove the converse theorem that, for a conservative system, 
the variational Eq. (11) leads to the equation of motion 

d 2 m 8 dW 

P dt 2 ~ ** + dxj den 

and the boundary conditions 

dW 

either 5 m , = 0 or -5— v 3 = Ti . 

<j€ij 

11.2. EXAMPLE OF APPLICATION — EQUATION 
OF VIBRATION OF A BEAM 

As an example of the application of Hamilton’s principle in the formu¬ 
lation of approximate theories in elasticity, let us consider the free, lateral 
vibration of a straight simple beam. We assume that the beam possesses 
principal planes and that the vibration takes place in one of the principal 
planes, and let y denote the small deflection of the neutral axis of the beam 
from its initial, straight configuration. In Sec. 10.8, it is shown that the 
strain energy of the beam is, for small deflections, 



where E is the Young’s modulus of the beam material, I is the cross- 
sectional moment of inertia, and l is the length of the beam. 

The kinetic energy of the beam is derived partly from the translation, 
parallel to y, of the elements composing it, and partly from the rotation of 
the same elements about an axis perpendicular to the neutral axis and the 
plane of vibration. The former part is 



where m is the mass per unit length of the beam. The latter part is, for 
each element dx , the product of moment of inertia times one-half of the 
square of the angular velocity. Let I p denote the mass moment of inertia 
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about the neutral axis per unit length of the beam. The angular velocity 
being d 2 y/dtdx, the kinetic energy of the beam is 



If the beam is loaded by a distributed lateral load of intensity p(x, t) per 
unit length and moment and shear M and Q, respectively, at the ends as 
shown in Fig. 11.2:1, then the potential energy of the external loading is 

(3) A = ~J P(x,t)y( x )dx - Mi +M 0 +QiVi-Qoyo- 



The equation of motion is given by Hamilton’s principle: 



“ M2 (S) / + M °( If) + 3m-Qo2/o}^ = 0. 

Following the usual procedure of the calculus of variations, noting that the 
virtual displacement must be so specified that 6y = 0 at to and t\, and, 
hence, d(6y)/dx = 5(dy/dx) = 0 at to and t\, we obtain 



d 2 y d 2 8y dydSy d 2 y d 2 5y , 
dx 2 dx 2 m dt dt p dxdt dxdt ^ j 


-Mi6 



i 


+ Mq8 



+ QtSyi - QoSyo 


jdt = 0. 
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Integrating by parts, we obtain 


( 6 ) 


n i 




d 2 y 
' dt 2 


a 2 ( _ d 2 y \ 

- dxdt \ Ip dxdt j 


5ydxdt 


dt 


+ i>§- 




dt = 0. 


Hence, the Euler equation of motion is 


( 7 ) 


dx 2 


El 


d 2 y 

dx 2 




and a proper set of boundary conditions at each end is 


(8a) 


d 2 v 

either El — M or 6 
ox 2 


= 0 


and 


(8b) 


either 



6y -0. 


These are equations governing the motion of a beam including the effect 
of the rotary inertia, due to Lord Rayleigh, and known as Rayleigh’s equa¬ 
tions. If the rotary inertia is neglected and if the beam were uniform, then 
the governing equation is simplified into: 


( 9 ) 

where 

( 10 ) 


^y + c 2 R 2 ^y 

dt 2 + 0 dx 4 


= —p, 



R 2 


J 

A ' 


The constant co has the dimension of speed and can be identified as the 
phase velocity of longitudinal waves in a uniform bar.I R is the radius of 
gyration of the cross section. A is the cross-sectional area, so that m = pA. 


^ See Problem 11.2, p. 390. 



386 


HAMILTON’S PRINCIPLE, WAVE PROPAGATION, . . . 


Chap. 11 


In the special case of a uniform beam of infinite length free from lateral 
loading, p = 0, Eq. (9) becomes 


in) 




It admits a solution in the form 

( 12 ) 


y = asm — (x — ct) 


which represents a progressive wave of phase velocity c and wave length A. 
On substituting Eq. (12) into Eq. (11), we obtain the relation 


(13) c = ifccoi?— , 

which states that the phase velocity depends on the wave length and that 
it tends to infinity for very short wave lengths. Somewhat disconcerting 
is the fact that, according to Eq. (13), the group velocity (see Sec. 11.3) 
also tends to infinity as the wave length tends to zero. Since group velocity 
is the velocity at which energy is transmitted, this result is physically un¬ 
reasonable. If Eq. (13) were correct, then the effect of a suddenly applied 
concentrated load will be felt at once everywhere in the beam, as the Fourier 
representation for a concentrated load contains harmonic components with 
infinitesimal wave length, and hence infinite wave speed. Thus, Eq. (11) 
cannot be very accurate in describing the effect of impact loads on a beam. 

This difficulty of infinite wave speed is removed by the inclusion of 
the rotary inertia. However, the speed versus wave length relationship 
obtained from Rayleigh’s Eq. (7) for a uniform beam of circular cross sec¬ 
tion with radius a, as is shown in Fig. 11.2:2, still deviates appreciably 
from Pochhammer and Chree’s results, which were derived from the exact 
three-dimensional linear elasticity theory. A much better approximation is 
obtained by including the shear deflection of the beam, as was first shown 
by Timoshenko. 

To incorporate the shear deformation, we note that the slope of the 
deflection curve depends not only on the rotation of cross sections of the 
beam but also on the shear. Let ip denote the slope of the deflection curve 
when the shearing force is neglected and f3 the angle of shear at the neutral 
axis in the same cross section. Then the total slope is 


dy 

dx 


= + /?• 


(14) 
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Pig. 11.2:2. Phase velocity curves for Fig. 11.2:3. A Timoshenko beam 
flexural elastic waves in a circular cylin- element, 
der of radius a. 


The strain energy due to bending, Eq. (1), must be replaced by 



because the internal bending moment does no work when shear deformation 
takes place (see Fig. 11.2:3). The strain energy due to shearing strain /? 
must be a quadratic function of /3 if linear elasticity is assumed. We shall 
write 



for the strain energy for shear. The kinetic energy is 



because the translational velocity is dy/dt, but the angular velocity is 
dip/dt. Hence, Hamilton’s principle states that 



where A is given by Eq. (3) except that dy/dx at the ends is to be replaced 
by ip. The virtual displacements now consist of Sy and Sip, which must 
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vanish at to and t\ and also where displacements are prescribed. On carry¬ 
ing out the calculations, the following two Euler equations are obtained: 


(19a) 

(19b) 


d_ 

dx 




6 

p dt 2 


= 0 , 


cPy 
l dt 2 


d_ 

dx 


k 

. 



— p = 0 . 


The appropriate boundary conditions are, at each end of the beam, 


(20a) 


Either 


El 


dip 

dx 


M 


or Sip = 0, 


and 

(20b) 


either 


fc (s-' A ) = “‘ 5 or 


Sy = 0. 


These are the differential equation and boundary conditions of the so-called 
Timoshenko beam theory. 

For a uniform beam, El, k, m , etc., are constants, and the function 
ip can be eliminated from the equations above to obtain the well-known 
Timoshenko equation for lateral vibration of prismatic beams, 


( 21 ) 


d 4 y d 2 y 

EI d + m 


= p + 


I P d 2 p 
k dt 2 



El d 2 p 
k dx 2 


d 4 y md^y 
dx 2 dt 2 p k dt 4 


So far we have not discussed the constants m, I p , and k. For a beam of 
uniform material, m = pA, I p = pAR 2 , where p is the mass density of the 
beam material, A is the cross-sectional area, and R is the radius of gyration 
of the cross section about an axis perpendicular to the plane of motion and 
through the neutral axis. But k depends on the distribution of shearing 
stress in the beam cross section. Timoshenko writes 


(22) k — k' AG, 

where G is the shear modulus of elasticity and k' is a numerical factor 
depending on the shape of the cross section, and ascertains that according 
to the elementary beam theory, k' = | for a rectangular cross section. 
The use of such a value of k is, however, a subject of controversy in the 
literature. Mindlin 111 suggests that the value of k can be so selected that 
the solution of Eq. (21) be made to agree with certain solution of the exact 
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three-dimensional equations of Pochhammer (1876) and Chree (1889) (see 
Love, 11 Elasticity, 4th edition, pp. 287-92). Indeed, I p , which arises in 
the assumption of plane sections remain plane in bending, may also be 
regarded, when such an assumption is relaxed, as an empirical factor to be 
determined by comparison with exact solutions. 

For a uniform beam free from lateral loadings, Eq. (21) can be written as 

(9?) d4y f 1 , 1 \ d *y , 1 . 1 d2 y n 

dx 4 ^Cg CqJ dx 2 dt 2 CqCq dt 4 CqR 2 dt 2 ' 

where 


(24) 


2 El E 2 k'G I 

cl = — = ~, c 2 q = -, R 2 = - . 

Ip p P A 


If the beam is of infinite length, a solution of the form (12) may be 
substituted into (23), and we see that the wave speed c must satisfy the 


equation 

(25) 





2 

= 0 . 


The solution of this equation for c/c 0 versus A yields two branches, corre¬ 
sponding to two modes of motion (two different shear-to-bending deflection 
ratios for the same wavelength). They are plotted in Fig 11.2:4 for the spe¬ 
cial case of a beam of circular cross section with radius a. The results of the 


exact solution of Pochhammer and Chree for Poisson’s ratio v = 0.29 are 


also plotted there for comparison. It is seen that the Timoshenko theory 
agrees reasonably well with the exact theory in the first mode, but wide 
discrepancy occurs in the second mode. The approximate theory gives no 
information about higher modes: an infinite number of which exist in the 
exact theory. 

The equations derived above are, of course, appropriate for the determi¬ 
nation of the free-vibration modes and frequencies of a beam. The effects of 
rotary inertia and shear are unimportant if the wavelength of the vibration 
mode is large compared with the cross-sectional dimensions of the beam; 
but these effects become more and more important with a decrease of wave¬ 
length, i.e., with an increase in the frequency of vibration. In the example 
of a uniform beam with rectangular cross section and simply supported at 
both ends, with E = |G and k' = |, we find that the shear deflection 
and rotary inertia reduce the natural frequencies. If the wavelength is ten 
times larger than the depth of the beam, the correction on the frequency 
due to rotary inertia alone is about 0.4 per cent, and the correction due to 
rotary inertia and shear together will be about 2 per cent. For a survey of 
literature, see Abramson, Plass, and Ripperger. 11,1 
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Fig. 11.2:4. Phase velocity curves for flexural elastic waves in a solid circular cylinder 
of radius a. (From Abramson, 111 J. Acoust. Soc. Am., 1957.) 


PROBLEMS 


11.2. Consider the free longitudinal vibration of a rod of uniform cross sec¬ 
tion and length L, as shown in Pig. Pll.2. Let us assume that plane cross sections 
remain plane, that only axial stresses are present, being uniformly distributed over 
the cross section, and that radial displacements are negligible (i.e., the displace¬ 
ments consist of only one non vanishing component u in the ^-direction). Derive 
expressions for the potential and kinetic energies and show that the equation of 
motion is 


(26) 


d 2 u 1 d 2 u _ 2 _ E 

dx 2 dt 2 ’ C ° p 


Show that the general solution is of the form 


(27) u = f(x - cot) + F(x + cot), 

where / and F are two arbitrary functions. 


—•■I dx 

h— 



rn 

T o'+tfo' 



- 4 - 

□ 




k 


Pll.2. Longitudinal vibration of a rod. 
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11.3. Consider the same problem as above, but now incorporate approxi¬ 
mately the transverse inertia associated with the lateral expansion or contraction 
connected with axial compression and extension, respectively. Let the (Love’s) 
assumption be made that the displacement in the radial direction v is propor¬ 
tional to the radial coordinate r, measured from a centroidal axis, and to the 
axial strain du/dx\ i.e., 

(28) -—S’ 

where v is Poisson’s ratio. Derive expressions of the kinetic and potential energies 
and obtain the equation of motion according to Hamilton’s principle, 


(29) 


d 2 u 
dt 2 ~ 


, r >,2 0 4 U 

( l/R ) dx 2 dt 2 




where R is the polar radius of gyration of the cross section. The natural boundary 
condition at the end x = 0, if that end is subjected to a stress <7o(t), is 


< 30 > + B §r="°<‘> “* = »■ 

Note: It is important to note that, according to the last equation, the familiar 
proportionality between axial stress o and axial strain du/dx does not exist in 
this theory. 

Comparison of the dispersion curves obtained from the elementary theory 
(Problem 11.2), the Love theory, the Pochhammer-Chree “exact” theory, and 
another approximate theory due to Mindlin and Herrmann, 11 ' 1 are shown in 



£ m rodiut of bor 
A wav* Isngth 


P11.3. Phase velocity curves for longitudinal elastic waves in a solid circular cylinder 
of radius a. (After Abramson et al., Adv. Applied Mech., 5, 1958.) 
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Fig. P11.3. The last-mentioned theory accounts for the strain energy associated 
with the transverse displacement v, of which the most important contribution 
comes from the shearing strain caused by the lateral expansion of the cross section 
near a wave front. 

11.4. The method of derivation of the various forms of equations of motion 
of beams as presented above has the advantage of being straightforward, but it 
does not convey the physical concepts as clearly as in an elementary derivation. 
Hence, rederive the basic equations by considering the forces that act on an 
element of length dx, as shown in Fig. P11.2 and Fig. P11.4. Obtain the following 
equations, and then derive the wave equations by proper reductions. 



Longitudinal waves, elementary theory (Fig. P11.2): 


(31) 

dcTxx 

dx 

d 2 u 
~ P dt* 

(equation of motion), 

(32) 

d 2 u 

dxdt 

_ de xx 
dt 

(equation of strain), 

(33) 

<J X X 

= Et xx 

(equation of material behavior), 

where cr xa 

, = axial stress, e xx 

= axial strain, du/dt = axial particle velocity, x 


axial coordinate, t = time, E = modulus of elasticity, and p = mass density. 


(34) 


Flexural waves, Timoshenko 

dM doj 

--Q = p I — 


dt 


theory (Fig. P11.4): 
(rotational) 1 


(35) 


dQ dv 

te =pA di 


(transverse) 


> (equations of motion) , 


(36) £ = £ (bending) ' 

(37) 1 = £ +W (8hear) 
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(38) M = EIK 

(bending) 

(39) Q = GA s j3 

(shear) 


| (equations of material behavior) , 


where M — moment, Q = shear force, K = axial rate of change of section angle 
= —dip/dx, (3 — shear strain = dy/dx — ip, u) = angular velocity of section = 
—dip/dt, v = transverse velocity = dy/dt, I = section moment of inertia, A = 
section area, and A, = area parameter defined by f J 'y(z)dA = (3A, where 7 ( 2 ) 
is the shear strain at a point 2 in the cross section. G = shear modulus. 

11.3. GROUP VELOCITY 

Since we have been concerned in the preceding sections about wave 
propagations in beams, it seems appropriate to make a digression to explain 
the concept of group velocity as distinguished from the phase velocity. We 
have seen that for certain equations a solution of the following form exists: 

( 1 ) u = a sin (fix — ut). 

If x is increased by 2n/p, or t by 2n/u, the sine function takes the same 
value as before, so that A = 2ir/p is the wavelength and T = 2n/u is the 
period of oscillation. If px ~ ut — constant, i.e. x = const. + ut/p, the 
argument of the sine function remains constant in time; which means that 
the whole waveform is displaced towards the right with a velocity c = u/p. 
The quantity c is called the phase velocity, in terms of which Eq. ( 1 ) may 
be exhibited as 

27T 

( 2 ) u — a sin — {x — ct). 

If the phase velocity c depends on the wavelength A, the wave is said to ex¬ 
hibit dispersion. Our examples in the previous section show that dispersion 
exists in both longitudinal and flexural waves in rods and beams. 

What happens when two sine waves of the same amplitude but slightly 
different wavelengths and frequencies are superposed? Let these two waves 
be characterized by two sets of slightly different values p, u and p\ u'. The 
resultant of the superposed waves is 

u + u' = ,A[sin (px — ut) + sin ( p'x — u't)\. 

Using the well-known formula 

sin a + sin/3 = 2 sin ^(a + ^)cos^(a - f3 ), 
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we have 


( 3 ) 


u + u! — 2.48111 


X cos 


^{p + p')x- + 

^{p-p')x-^{u-u')t . 


This expression represents the well-known phenomenon of “beats.” The 
sine factor represents a wave whose wave number and frequency are equal 
to the mean of p, pi and o, v', respectively. The cosine factor, which varies 
very slowly when p — p ', v — o' are small, may be regarded as a varying 
amplitude, as shown in Fig. 11.3:1. The “wave group” ends wherever the 



Fig. 11.3:1. An illustration of a wave group. 


cosine factor becomes zero. The velocity of advance of these points is 
called the group velocity, its value U is equal to (o — v')/{p — p')- For long 
groups (or slow beats), the group velocity may be written with sufficient 
accuracy as 


( 4 ) 


U = 


do 

dp 


In terms of the wavelength A(= 2ir/p), we have 


( 5 ) 


U=‘ 

dp d\ 


where c is the phase velocity. 

From the fact that no energy can travel past the nodes, one can infer 
that the rate of transfer of energy is identical with the group velocity. This 
fact is capable of rigorous proof for single trains of waves. 

The most familiar examples of propagation of wave groups are perhaps 
the water waves. It has often been noticed that when an isolated group of 
waves, of sensibly the same length, advancing over relatively deep water, 
the velocity of the group as a whole is less than that of the individual waves 
composing it. If attention is fixed on a particular wave, it is seen to advance 
through the group, gradually dying out as it approaches the front, while its 
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former place in the group is occupied in succession by other waves which 
have come forward from the rear. Another familiar example is the wave 
train set up by ships. The explanation as presented above seems to have 
been first given by Stokes (1876). Other derivations and interpretations of 
the group velocity concept can be found in Lamb, Hydrodynamics , (1879, 
New York: Dover Pub.) Secs. 236 and 237. 

If a concentrated lateral load is suddenly and impulsively applied on 
an infinitely long beam, the disturbance is propagated out along the beam 
by flexural waves. The initial loading may be regarded as composed of 
an infinite number of sine-wave components of all wavelengths but of the 
same amplitude, with proper phase relationship, so that they reinforce each 
other in the limited region where the force is applied, but cancel each other 
everywhere outside the region of load application. As time increases, these 
sine waves propagate with their own phase velocities, and the pattern of 
interference changes with time. Thus, at time t and at a point which 
is at a distance x from the initial loading, only a group of waves in the 
neighborhood of a specific wavelength can be seen, and the energy of this 
wave group is propagated at the group velocity U. 

From the dispersion curves of Fig. 11.2:4 for the phase velocity of flexural 
elastic waves in a circular cylinder of radius a, the group wave velocity 



Fig. 11.3:2. Group velocity curves for flexural elastic waves in a solid circular cylinder 
of radius a. (After Abramson, J. Acoust. Soc. Am., 1957.) 


curves of Fig. 11.3:2 for the two lowest modes can be constructed. These 
curves are obtained by Abramson 11,1 for the Pochhammer-Chree theory 
and are important in physical considerations. The number ci in Fig. 11.3:2 
is the dilatational wave speed (see Sec. 7.8) given by the formula 

(1 - u)E 

(1 -I- i/)(l - 2i/)p ' 


Cl = 
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The number c 0 is y/E/p. The group velocity is denoted by c g . The ratio 
ci/cq depends on the Poisson’s ratio v\ 


u = 0.25, — = 1.095, 

co 

u = 0.30 , — = 1.16. 

Co 

Note that the phase velocities in the higher branches exceed the dilatational 
wave velocity (at long wavelengths), but the group velocities do not. Sim¬ 
ilar group velocity curves can be constructed for longitudinal waves from 
Fig. Pll.3. Note that the greatest possible velocity of energy transmission 
is the dilatational velocity, for both flexural and longitudinal waves. 

From Fig. 11.3:2 for a circular cylinder of radius a, it can be seen that 
for the first branch the group velocity reaches a maximum when a /A is 
between 0.25 and 0.30, where A is the wavelength. Hence, when a pulse 
is propagated along a beam in accordance with the first branch, Fourier 
components of wavelengths about 0.25 or 0.30 times the radius will be 
found at the head of the pulse. A detailed study of the transmission of a 
pulse along the beam can be found in R. M. Davies’ paper. 11 2 

We must remark that elastic substances are not inherently dispersive, 
but the dispersion effect appears in waves traveling through rods and beams 
because of wave reflections from the boundaries. Therefore, the dispersion 
in the case of an elastic medium is simply an interface phenomenon and 
not a physical property of the material. The phenomenon is analogous 
to that obtained in electrical wave guides and can be discussed in similar 
terms. In this light, we can answer the question of the existence of phase 
velocities greater than the dilatational velocity, although no disturbance can 
be propagated with a velocity greater than that. The point is that energy 
is propagated at the group velocity and that group velocities greater than 
the dilatational velocity will not occur. The phase velocities are merely 
velocities of propagation in the axial direction of the loci of constant phase 
in specific mode patterns and their values are thus not limited by physical 
considerations. 

11.4. HOPKINSON’S EXPERIMENT 

In Section 1.6, we mentioned that John Hopkinson published in 1872 
an account of an interesting experiment, the explanation of which will help 
us understand the nature of elastic wave propagation and its significance 
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in engineering. Hopkinson wanted to measure 
the strength of steel wires when they were sud¬ 
denly stretched by a falling weight. A ball-shaped 
weight pierced by a hole was threaded on the wire 
and was dropped from a known height so that 
it struck a clamp attached to the bottom of the 
wire (Fig. 11.4:1). For a given weight he ex¬ 
pected to find a critical height beyond which the 
falling weight would break the wire. Using differ¬ 
ent weights dropped from different heights, how¬ 
ever, Hopkinson found that the minimum height 
from which a weight had to be dropped to break 
the wire was nearly independent of the size of the Fig. 11.4:1. A sketch of 

Weight! Hopkinson’s experiment. 

Now, when different weights are dropped from a given height, the veloc¬ 
ity reached at the end is independent of the size of the weight. Hopkinson’s 
result suggests that in breaking the wire it is the velocity of the loading end 
that counts. Following this lead, Hopkinson explains his result on the basis 
of elastic wave propagation (ignoring the effects of plasticity). Assuming 
that the stress state in the wire is approximately one-dimensional, we see 
from Eqs. (11.2:27), or (11.2:31)-(11.2:33), that the stress in the wire is 
proportional to the particle velocity: 

m _ du 

( 1 ) — P C 0 ) 

where Co = \/E/p is the speed of sound of longitudinal waves in the wire. 
For steel, cq is about 16,000 ft/sec. The largest particle velocity in the wire, 
however, is not reached at the instant of the blow, but is reached at the 
top after the elastic waves propagate up and down the wire several times. 
When this largest particle velocity induces a stress equal to the ultimate 
stress of the wire, the wire breaks. 

When the weight hits the clamp in Hopkinson’s test, the end of the wire 
acquires a particle velocity Vo equal to that of the weight and clamp. A 
steep-fronted tension wave is generated and propagated up the wire. In the 
meantime, the weight and the clamp are slowed down exponentially by the 
tensile load imposed by the wire: 

dV 

(2) M— = Aa xx = Apc 0 V , 

where M is the mass of the weight and clamp, V is its velocity, and A is the 
cross-sectional area of the wire. The elastic wave, on reaching the fixed end 
at the top, is reflected as a tension wave of twice the intensity of the incident 
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wave. The reflected wave, going through the tail of the incident pulse, is 
reflected again at the lower end as a compression wave, and so on. If the 
stress at the first reflection (equal to 2pc 0 Vo), is sufficient to break the wire, 
then fracture will be expected to take place near the top. In a systematic 
test of gradually increasing the height of drop h (i.e., gradually increasing 
Vo), this breaking at the first reflection does not happen. In J. Hopkinson’s 
experiments, the head of the stress waves was able to travel the length of 
wire several times before the weight M was decelerated sufficiently, and the 
stress wave pattern in the wire was very complicated. Bertram Hopkinson 
(1905), in repeating his father’s experiment, used smaller weights so that 
the rate of decay was rapid. Nevertheless, as shown by G. I. Taylor 11 ' 2 
(1946), the maximum tensile stress in B. Hopkinson’s experiment did not 
occur at the first reflection, when the stress was 2 pcoVo, but at the third 
reflection, i.e., the second reflection at the top of the wire, when the tensile 
stress reached 2.15pc 0 Vo. 

To work out the details, we have to determine the function f(x—cat) and 
F(x + cot) of Eq. (11.2:27) to satisfy the initial conditions u = du/dt = 0 
at t = 0, and the boundary conditions u — 0 at the top; du/dt = V, 
the velocity of the weight and clamp, at the bottom. The velocity V is 
determined by Eq. (2). The full solution is given in Taylor’s paper . 11 2 

Incidentally, Hopkinson found that the tensile strength of steel wires 
under rapid loading is much greater than that under a static load. 

11.5. GENERALIZED COORDINATES 

We shall now turn our attention to the actual construction of the min¬ 
imizing sequence in solving a variational problem. Our aim is to ob¬ 
tain efficient approximate solutions. For this purpose, we shall discuss 
first the idea of generalized coordinates and the definitions of the “best 
approximations. ” 

Let us consider a function u(x) of a real variable x, defined over an 
interval (a, b). If u(x) can be represented by a power series 

(1) u(x) = a 0 + a\x + a 2 x 2 H-(- a n x n -)- 

or by a Fourier series 

OO 

( 2 ) u(x) = E b n sin nx 

n= 1 

or, more generally, by a series of the form 

OO 

(3) u(x) = ^T l c n <f> n (x), 

n =1 
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where {(j> n (x)} is a given set of functions, then u(x) can be specified by the 
sets of coefficients {a n }, {fr n }, or {c n }. These coefficients may be regarded 
as the generalized coordinates of u(x) referred to the bases {a:”}, {sinna:}, 
{<£„}, respectively. In Eq. (3), each term <f> n (x) represents a degree-of- 
freedom, and the coefficient c n specifies the extent to which <p n (x) partici¬ 
pates in the function u(a;). For example, if u(x) represents a disturbed state 
of an elastic string which is anchored at the points x = 0 and x = tt, then 
Eq. (3) states that u(x) can be obtained by a superposition of successive 
normal modes <j) n = sinna:, and that the nth mode participates with an 
amplitude c n . 

This terminology is conventional when c n is a function of time t. For 
example, in the dynamics of a vibrating string the displacement u(x, t) may 
be written as 

OO 

(4) u(x, t) — ^ q n ( t ) sin nx. 

n =1 

The quantity q n (t ) is known as the generalized coordinate with respect to 
the mode sin nx. The possibility of generalizing this terminology to cases 
in which x and u represent vectors in vector spaces of dimensions m and 
k, respectively, is obvious. 

The fundamental idea of approximate methods of solution is to reduce 
a continuum problem (with infinite number of degrees-of-freedom) into one 
of a finite number of degrees-of-freedom. The reduced problem is solved 
by usual methods. Then a limiting process is used to extend the restricted 
solution to the solution of the original problem. In the following discussion 
we shall describe the ideas about the first step — reducing a given prob¬ 
lem to one of finite degrees-of-freedom. The second step — the proof of 
the validity of the limiting process — is the main mathematical problem. 
Examples of the mathematical theories which deal with the second step 
are the theories of Taylor’s series, Fourier series, eigenfunction expansions, 
etc. The theories are beyond the scope of our discussion, and the reader is 
referred to the Bibliography 11.3. 

11.6. APPROXIMATE REPRESENTATION OF FUNCTIONS 

Let a function f(x i,X 2 ,..., x n ) of n variables x\,...,x n be defined in a 
domain 7L(xx,X2, . ■ ., x n ); or, in brief notation, f(xj) in 7l(xj). To repre¬ 
sent f(xj) “as well as possible” by a linear combination of given functions 
4> n (xj), n ~ 1 ,2,..., N, say, Qwixj), where 

N 

Q N (Xj) = Yl a n<J> n (Xj), 

n— 1 


(i) 
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the question arises as to the meaning of the term “as well as possible.” 
As soon as the “best approximation” is defined, a definite procedure for 
calculating a n can be devised. 

The usual definitions of the “best approximation” may be classified 
into two classes.! In the first class the concept of norm is introduced. For 
any function g(xj) in the domain TZ(xj), a real positive number, called the 
norm of g(xj), and denoted by |jp(x J -)||, is defined, with the stipulation that 
Il5( a b')ll — 0 only for g(xj) = 0 in lZ(xj). Then the best approximation is 
taken to mean that 


( 2 ) 


N 

I f( x j) ~~ ^ ' a n^n{ x j) 

n —1 


min. 


In the second class, a set of linear homogeneous expressions L k , k ~ 
1 , 2 ,,N, defined for functions in the domain IZ(xj), is selected, and the 
best approximation is taken to mean that 


(3) L k (e N )= 0 , = 1,2,...,7V, 


where is the “error” 
(4) e N = 


N 


SN(Xj) — f(Xj) y On ( j)n(Xj ) . 

n = l 


Examples of the first class are 

(A) Absolute error method'. In this method the norm for any function 
g(xj) in TZ(xj) is defined to be the maximum absolute value of g(xj ) in 
TZ(xj): 

(5) Ils(*j)l| = max |ff(xj)|. 

x in 7c 

Hence, the best approximation is defined by requiring that 

(6) ||ejv|| = max^ |€n(xj)| = min. 

This method was used in the early nineteenth century, but is not in favor 
now because it is difficult to apply. 

(B) Least squares method: In this method we take a positive integrable 
function W (xj ) and define the norm 

( 7 ) \\g(x 3 )\\ = J j^WgMv , 

tWe follow Collatz 11 - 3 (1955) in the subsequent exposition. 
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where W(xj ) is called the weighting function. The volume integral, which 
may be defined in Eiemann or Lebesgue sense, is assumed to exist, dv being 
a volume element. In this method, the constants a n are to be determined 
from the equations 

(8) 10||e^ = _f ^ k — 1,2,... ,N. 

2 da k J n 

Examples of the second class are 

(C) Collocation method: We take N points in the domain TZ(xj) 
and let 


( 9 ) 


Tfc(ejv) = f-N{Pk) ■ 


Hence Eq. (3) implies that the constants a n are so chosen that f(xj) is 
represented exactly at N points. A substituting leads to the equations 

(10) y £2 a N4>N(Pk) = f{Pk ), k = 1,2 ,..., N. 

If the determinant A of coefficients of a n does not vanish, 


MPi) ■■■ 

MPn) 

4>n{Pi) ■■■ 

J>n{Pn) 


the constants a n can be uniquely determined. 


^ 0 , 


(D) Orthogonality method: We choose a set of N linearly independent 
functions gk(xj) and let 

(11) Lic(cn) — f tNgkdv , k = l,2,...,N. 

J n 

If we choose = W , then we get the conditions 

( 12 ) f tN^4>kdv = 0 

Jn 

identical with the least squares method. 


(E) Subregion method: As a generalization of the collocation method, we 
subdivide the region 7Z(xj) into N subregions 7Z\ , TZ^, • • • > P-N and define 

(13) L k (e) = / edv, 

Jn k 

then follow the steps of the collocation method. 


k=l,...,N. 





402 


HAMILTON’S PRINCIPLE, WAVE PROPAGATION, . . . 


Chap. 11 


11.7. APPROXIMATE SOLUTION OF DIFFERENTIAL 
EQUATIONS 

If we try to approximate the solution u(xj) of a differential equation 
defined over a domain lZ(x i,... ,xjv) with certain boundary conditions by 
an expression 

N 

( 1 ) Qw(xj) = ^^a n (p n (xj), 

n =1 

we may take one of the following approaches: (A) each <f> n {xj) satisfies the 
boundary conditions, (B) each <p n {x 3 ) satisfies the differential equation. In 
either case an error function can be defined, and it is required to be “as 
small as possible.” 

For example, consider the case (A). Let the differential equation be 

(2) £{u(x j )} = 0. 

A substitution of Eq. (1) into Eq. ( 2 ) gives the error 

(3) £]y(xj ) = ^ ^ e. n 0 n (xj) 

which must be minimized. The definition of the “best approximation” can 
be varied, but all methods of the previous section can be used here. 

A possible generalization of Eq. ( 1 ) to the form 

Qn(Xj) = w(x u ... ,x N ;a u . ..,ajv), 

which depends on the N parameters a n , seems evident, although a mathe¬ 
matical theory of such a generalization is by no means simple. 

11.8. DIRECT METHODS OF VARIATIONAL CALCULUS 

Ideas similar to those presented in the preceding sections can be applied 
to variational problems. Let V be the class of admissible functions u(xj) 
of n variables xi,X 2 , ■. ., x n , or Xj for short, defined over a domain 7Z(x.j) 
and satisfying all the “rigid” boundary conditions (cf. Sec. 10.8). Let y(xj), 
belonging to V, minimize the functional I[u). The following approximate 
representations of y(xj) are suggested: 

Method of Finite Differences. We choose a set of points P kt k — 
1,2,... ,N, in 71 and let 

(l) y{Pk) = yk , 


P k in 7 Z,k — 1,2 ,... ,N . 
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The derivatives of y(xj) are then replaced by expressions involving succes¬ 
sive finite differences of yi, 1 / 2 , • • •, Vn, according to the calculus of finite 
differences, and the integrations are replaced by a finite summation. Then 
the functional I{y\ becomes a function ^( 2 / 1 , 1 / 2 , • • • ,Un)- We choose the 
values y\, ■ ■. ,Vn so that 'i’(yj) has an extremum; i.e., 

( 2 ) Wt =0, t =l,...,N. 

This method was used by Euler. 

Rayleigh-Ritz Method. We set 

N 

(3) yN{x J ) = Y^a n <t>n(xj ), 

n=l 

where <j> n (xj) are known functions which are so chosen that y^(xj) are 
admissible, i.e., they belong to the class V. When Eq. (3) is substituted 
into I[u\, the latter becomes a function ^(oi, 02 ,..., ojv) of the coefficients 
a,j. These coefficients a 1 ,..., a jy are then so chosen that 'f'(oj) has an 
extremum; i.e., 

< 9 $ 

(4) *£=0, j = 1,.. ■ ,N. 

Kantorovich's Method. We choose a set of coordinate functions 
<t>l(xi,x 2 , ...,X n ), <j> 2 (xi,X 2 , ■ . . ,x n ), 1 ,X 2 ,. .. ,X n ) 

and try an approximate solution in the form 
N 

( 5 ) vn = 'y' / ak(xi)4>k{xi,x 2 ,,.. ,x n ), 

fc=i 

where the coefficients a*,(xi) are no longer constants, but are unknown 
functions of one of the independent variables. The functional I[u) is re¬ 
duced to a new functional ^[a^xi), 02 ( 0 : 1 ), ... , ayv(xi)], which depends on 
N functions of one independent variable, 

(6) ai(xi),o 2 (xi),...,aiv(xi). 

The functions (6) must be chosen to minimize the functional ’f r (Sec. 10.3). 
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It should be recognized that Kantorovich’s method is a generalization 
of a method commonly used in the classical treatment of small oscillations 
of an elastic body, in which the displacements are represented in the form 

N 

(7) u{x,t) ='*>2qk(t)<t>k{x ), 

fc=i 

where 4>k(x), depending solely on the spatial coordinates, are the bases of 
the generalized coordinates. The Euler equations of the energy integral, 
according to Hamilton’s principle, are the Lagrangian equations of motion. 

Galerkin’a Method. Galerkin’s idea of minimization of errors by or- 
thogonalizing with respect to a set of given functions is best illustrated by 
an example. Consider the simple beam discussed in Sec. 10.8. The mini¬ 
mization of the functional V, Eq. (10.8:3), leads to the variational equation 



Here El, and p are known functions of x\ M, Q are unspecified if an end 
is “rigid” [where Sy = 6(dy/dx) = 0); or are given numbers if an end is 
“natural” [where 6y and 5(dy/dx) are arbitrary]. 

Let us consider a clamped-clamped beam (both ends “rigid”). If we 
assume 

N 

(9) VN = ^2ak<l>k{x), 
where 

(10) 4>k =~ 0 at x — 0 and x — l , fc=l,...,iV ) 

dx 

then, in general, Eq. (8) cannot be satisfied. The quantity 



may be called an error. Now we demand that the coefficients a be so chosen 
that Eq. (8) be satisfied when 5y is identified with any of the functions 
4>i (x),h(x) t ...,(p„{x): 

(12) f € n (x)<j>k (x)dx = 0, k = 1,..., N . 

Jo 

In other words, we demand that the error be orthogonal to <f>k( x )- 
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If some of the end conditions are “natural,” we still represent the ap¬ 
proximate solution in the form (9) and require y n {x) to satisfy only the 
“rigid” boundary conditions wherever they apply. In this case, Eq. (8) 
leads to the following in place of Eq. (12). 


(13) j\ n {x)4> k {x)dx + EI^ - M 

- L =0, k = 1,2, 

dx y dx J q 

Equations (12) or (13) are sets of N linear equations to solve for the N 
unknowns constants ai,..., a n . 


Trefftz’ Method. Consider again Eq. (8). In the Trefftz method, the 
aproximate solution (9) is so chosen that <p k {x) satisfies the Euler equation 


dx 2 



— p— 0 


but not necessarily the boundary conditions. We now select the coefficients 
a k in such a way that Eq. (8) is satisfied if the variations Sy were limited 
to the set of functions </>*: 



PROBLEMS 

11.5. Saint-Venant’s problem of torsion of a shaft with a cross section occu¬ 
pying a region R on the x, y plane bounded by a curve (or curves) C, is discussed 
in Sec. 7.5. Show that the Saint-Venant’s theory is equivalent to finding a function 
ip(x,y) (Prandtl’s stress function, p. 167) which minimizes the functional 



under the boundary condition that dip/ds = 0 on C. If the region R is simply 
connected, we may take ip = 0 on C. If R is multiply connected, we take ip equal 
to a different constant on each boundary. The torque is then given by Eq. (7.5:24). 

Consider a shaft of rectangular cross section. Obtain approximate solutions 
to the torsion problem by two different direct methods outlined in Sec. 11.8. 
Compare the values of the torsional rigidity of the shaft obtained in each method. 
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(Reference, Sokolnikoff 1 2 , 2nd edition, Chapter 7, pp. 400, 416, 423, 430, 437, 
442.) 

11.6. Consider the pin-ended column described in Problem 10.11, p. 313. 
Assume that the column and the spring are massless but that a lumped mass 
M is situated on the column at the point A. At the instant of time t = 0, the 
column is plucked so that the deflection at A is Ao 5^ 0. By Hamilton’s principle, 
or otherwise, find the equation of motion of the mass. Integrate the equation 
of motion to determine the motion of the mass in the case in which the lateral 
spring is not there (K — 0, a = 0). 

11.7. Let the load P of Problems 10.11 and 11.6 be an oscillatory load, 

P = Pq + Pi cos ujt. 

Could the column become unstable (i.e, with the amplitude of motion unbounded 
as t -4 00)? In particular, if Po — Per, the buckling load, is there a chance that 
by properly selecting Pi and uj, the column remains stable? Discuss the case of 
no lateral spring (K = 0, a = 0) in greater detail. 

11 .8. Consider longitudinal wave propagation in a slender rod which in var¬ 
ious segments is made of different materials. It is desired to transmit the wave 
through the rod with as little distortion as possible. How should the material 
constants be matched? (This is a practical problem which often occurs in in¬ 
strumentation, such as piezoelectric sensing, etc.) Hint Consider transmission of 
harmonic waves without reflection at the interfaces between segments. If quanti¬ 
ties pertaining to the nth segment are indicated by a subscript n, show that we 
must have p„c„ = const, for all n or, in another form, E n p n = const. 

11.9. Consider a beam on an elastic foundation. A traveling load moves at 
a constant speed over the beam. Determine the steady-state elastic responses of 
the beam. 

11.10. Consider the longitudinal vibrations of a slender elastic rod of vari¬ 
able cross-sectional area A(x). Determine the speed of propagation of longitudinal 
waves, assuming that the rod is so slender that the variation of the elastic displace¬ 
ment over the cross section is negligible. If one end of the beam is subjected to 
a harmonic longitudinal oscillation with the boundary condition (u) x= o = ae' wt , 
whereas the other end is free, determine the beam response for the following cases: 


(1) 

A(x) = const. 


(2) 

A(x) = Aox 2 , 

(0 < a < x < b ), 

(3) 

A(x) = A 0 e 2 * IL , 

CT 

VI 

H 

VI 

O 
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ELASTICITY AND THERMODYNAMICS 


In this chapter we shall consider what restrictions classical thermodynam¬ 
ics imposes on the theory of solids and what information concerning solid 
continua can be deduced from thermodynamics. We shall discuss also the 
question of existence and positive definiteness of the strain energy function 
which is of central importance in the theory of elasticity, 

12.1. THE LAWS OF THERMODYNAMICS 

A brief summary of the basic structure of the classical theory of thermo¬ 
dynamics is given in this section. We call a particular collection of matter 
which is being studied, a system. Only closed systems, i.e., systems which 
do not exchange matter with their surroundings, are considered here. Oc¬ 
casionally, a further restriction is made that no interactions between the 
system and its surroundings occur; the system is then said to be isolated. 

Consider a given system. When all the information required for a com¬ 
plete characterization of the system for a purpose at hand is available, it 
will be said that the state of the system is known. For example, for a certain 
homogeneous elastic body at rest, a complete description of its thermody¬ 
namic state requires a specification of its material content, i.e., the quantity 
of each chemical substance contained; its geometry in the natural or un¬ 
strained state; its deviation from the natural state, or strain field; its stress 
field; and, if some physical properties depend on whether the body is hot 
or cold, one extra independent quantity which fixes the degree of hotness 
or coldness. These quantities are called state variables. If a certain state 
variable can be expressed as a single-valued function of a set of other state 
variables, then the functional relationship is said to be an equation of state, 
and the variable so described is called a state function. The selection of a 
particular set of independent state variables is important in each problem, 
but the choice is to a certain extent arbitrary. 

If, for a given system, the values of the state variables are independent 
of time, the system is said to be in thermodynamic equilibrium. If the state 
variables vary with time, then the system is said to undergo a process. The 
number of state variables required to describe a process may be larger than 
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that required to describe the system at thermodynamic equilibrium. For 
example, in describing the flow of a fluid we may need to know the viscosity. 

The boundary or wall separating two systems is said to be insulating if it 
has the following property: if any system in complete internal equilibrium 
is completely surrounded by an insulating wall, then no change can be 
produced in the system by an external agency except by movement of the 
wall or by long-range forces such as gravitation. A system surrounded by 
an insulating wall is said to be thermally insulated, and any process taking 
place in the system is called adiabatic. 

A system is said to be homogeneous if the state variables does not de¬ 
pend on space coordinates. The classical thermodynamics is concerned 
with the conditions of equilibrium within a homogeneous system or in a 
heterogeneous system which is composed of homogeneous parts (“phases”). 
A generalization to nonhomogeneous systems requires certain additional 
hypotheses which will be considered in Sec. 13.1. In this chapter we shall 
restrict our attention to homogeneous systems. 

The first step in the formulation of thermodynamics is to introduce the 
concept of temperature. It is postulated that if two systems are each in 
thermal equilibrium with a third system, they are in thermal equilibrium 
with each other. From this it can be shown to follow that the condition 
of thermal equilibrium between several systems is the equality of a certain 
single-valued function of the thermodynamic states of the systems, which 
may be called the temperature any one of the systems being used 
as a “thermometer” reading the temperature on a suitable scale. The 
temperature whose existence is thus postulated is measured on a scale which 
is determined by the arbitrary choice of the thermometer and is called the 
empirical temperature. 

The first law of thermodynamics can be formulated as follows. If a 
thermally insulated system can be taken from a state I to a state II by 
alternative paths, the work done on the system has the same value for every 
such ( adiabatic ) path. From this we can deduce that there exists a single¬ 
valued function of the state of a system, called its energy, such that for an 
adiabatic process the increase of the energy is equal to the work done on 
the system. Thus, 

(1) A energy = work done (adiabatic process). 

It is to be noticed that for this definition of energy it is necessary and 
sufficient that it be possible by an adiabatic process to change the system 
either from state I to state II or from state II to state I. 
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We now define the heat Q absorbed by a system as the increase in energy 
of the system, less the work done on the system. Thus, 

(2) Q = A energy —work done (all processes), 

or 

(3) A energy — Q + work done. 

If this is regarded as a statement of the conservation of energy and is 
compared with (1), we observe that the energy of a system can be increased 
either by work done on it or by absorption of heat. 

It is customary to identify several types of energy which make up the 
total: the kinetic energy K, the gravitational energy G, and the internal 
energy E. Thus, the equation 

(4) energy = K + G + E 

may be regarded as the definition of the internal energy. 

We now formulate the second law of thermodynamics for a homogeneous 
system as follows. 

There exist two single-valued functions of state, T, called the absolute 
temperature, and S, called the entropy, with the following properties: 

I. T is a positive number which is a function of the empirical temper¬ 
ature 3? only. 

II. The entropy of a system is equal to the sum of entropies of its parts. 

III. The entropy of a system can change in two distinct ways, namely, 
by interaction with the surroundings and by changes taking place 
inside the system. Symbolically, we may write 

(5) dS = d e S + diS, 


(6) 


where dS denotes the increase of entropy of the system, d e S denotes 
the part of this increase due to interaction with the surroundings, 
and d{S denotes the part of this increase due to changes taking place 
inside the system. Then, if dQ denotes the heat absorbed by the 
system from its surroundings, we have 


d e S = 


dQ 
T ' 


The change diS is never negative. IfdiS is zero the process is said to 
be reversible. If diS is positive the process is said to be irreversible: 
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diS > 0 (irreversible process), 

diS = 0 (reversible process). 

The remaining case, diS < 0, never occurs in nature. 

The absolute temperature T and the entropy S are two fundamental 
quantities. We will not attempt to define them in terms of other quantities 
regarded as simpler. They are defined merely by their properties expressed 
in the second law. Lord Kelvin has shown how it is possible to calibrate 
any thermometer into absolute temperature scale. (See Problem 12.3.) The 
scale of the absolute temperature is fixed by defining the temperature at 
equilibrium between liquid water and ice at a pressure of 1 atm at 273.16°K, 
i.e., 273.16 degrees on Kelvin’s scale. 

These postulates form the basis of the classical thermodynamics. The 
justification of these postulates is the empirical fact that all conclusions 
derived from these assumptions are without exception in agreement with the 
experimentally observed behavior of systems in nature at the macroscopic 
scale. The form in which these principles are enunciated above is essentially 
that used by Max Born 121 (1921) and I. Prigogine 13 1 (1947). 

It is important to familiarize ourselves with the concept that entropy is 
an attribute to a material body, just as its mass or its electric charges are. 
A pound of oxygen has a definite amount of entropy, which can be changed 
by changing the temperature and specific volume of the gas. A pound of 
steel at a given temperature and in a given state of strain has a definite 
amount of entropy. It is instructive to consider Problems 12.5 and 12.6 to 
obtain some intuitive feeling about entropy. 

PROBLEMS 

12.1. Calculate the work performed by 10 g of hydrogen expanding isother- 
mally at 20°C from 1 to 0.5 atm of pressure. Note: The equation of state for 
a system composed of m grams of a gas whose molecular weight is M is given 
approximately by pv = ( m/M)RT , where A is a universal constant (same for all 
gases); R = 8.314 x 10 7 ergs/degree, or 1.986 cal/degrees.) 

12.2. Calculate the energy variation of a system which performs 8 x 10 s ergs 
of work and absorbs 60 cal of heat. (Note: 1 cal = 4.185 x 10 7 ergs.) 

12.3. To calibrate an empirical temperature scale against the absolute tem¬ 
perature scale, let us consider a heat engine. A working substance is assumed 
which undergoes a cycle of operations involving changes of temperature, volume, 
and pressure and which is eventually brought back to its original condition. In 
a Carnot cycle, an ideal gas is first brought to contact with a large heat source 
which is maintained at a constant empirical temperature JA The gas at the 
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equilibrium temperature 5j, is allowed to expand (isothermally) and absorbs an 
amount of heat Q \, from the heat source. Then it is removed and insulated from 
the heat source and is allowed to expand adiabatically to perform work, lowering 
its empirical temperature to a value 5^. Next it is put in contact with a heat 
sink of temperature and is compressed isothermally to release an amount of 
heat Q 2 . Finally, it is recompressed adiabatically back to its original state. If 
every process in the Carnot cycle is reversible, the engine is said to be reversible, 
otherwise it is irreversible. 

From the second law of thermodynamics prove that: 


(a) When the temperatures of the source and sink are given, a reversible 
engine is the most efficient type of engine, i.e., it yields the maximum 
amount of mechanical work for a given abstraction of heat from the 
source. Note: Efficiency = (<2i — Q 2 )/Qi- 

(b) The efficiency of a reversible engine is independent of the construction 
or material of the engine and depends only on the temperatures of the 
source and sink. 

(c) In a reversible Carnot engine, we must have 

^ = /(* u&), 


where /(5i ,.%) is a universal function of the two temperatures 3\ and 
52. Show that the function /(5I, 5i) has the property 


f(3 1,^) 




where 5o, 5i, and 52 are three arbitrary temperatures. If 5o is kept con¬ 
stant, we may consider /(■%, 3) as being a function of the temperature 
3 only. Hence, we may write 


Kf(3 o ,3) = 0(3), 


where if is a constant. Thus, 

Qi _ Qz 

0(3 j) - 0(5 2 ) ' 

Lord Kelvin perceived (1848) that the numbers 0(5 1 ), 0(5^), may be 
taken to define an absolute temperature scale. Identify this with the T 
introduced above. 

(d) It is suggested that the reader review the theory of the Joule-Thomson 
experiment, which is used in practice to calibrate a gas thermometer to 
absolute temperature scale. 


12.4. By considering an ideal gas as the working fluid of a Carnot cycle, 
show that the gas thermometer reads exactly the absolute temperature. 

12.5. Derive the following formula for the entropy of one mole of an ideal gas. 

5* = C v log T + R log V + a, 
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where C v is the specific heat at constant volume, R is the universal gas constant, 
and a is a constant of integration. 

12.6. Prove the following analogy. The work that one can derive from a 
waterfall is proportional to the product of the height of the waterfall and the 
quantity of the water flow. The work that can be derived from a heat engine 
performing a Carnot cycle is proportional to the product of the temperature 
difference between the heat source and the heat sink, and the quantity of entropy 
flow. 

12.2. ENERGY EQUATION 

We shall now express the equation of balance of energy, Eqs. (12.1:3) 
and (12.1:4), in a form more convenient for continuum mechanics. As an 
exception in the present chapter, this section is not limited to homogeneous 
systems. 

For a body of particles in a configuration occupying a region V bounded 
by a surface S, the kinetic energy is 

(1) K = J -pviVidv, 

where Vi are the components of the velocity vector of a particle occupying 
an element of volume dv, and p is the density of the material. The internal 
energy is written in the form 

(2) E = f pSdv , 

Jv 

where S is the internal energy per unit mass. * The heat input into the 
body must be imparted through the boundary. A new vector, the heat flux 
h, with components hi , li 2 - hi, is defined as follows. Let dS be a surface 
element in the body, with unit outer normal Vi. Then the rate at which 
heat is transmitted across the surface dS in the direction of i>i is assumed 
to be representable as hiVidS. To be specific, we insist on defining the 
heat flux in the case of a moving medium that the surface element dS be 
composed of the same particles. The rate of heat input is therefore 

(3) Q — — J hiVidS = — J hjjdv 

t It is interesting to verify that if one chooses to use the internal energy per unit volume S v , 
instead of $ per unit mass, so that E = J £ v dv then the energy equation, corresponding 
to Eq. (7) infra, will be somewhat more complicated. 
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The rate at which work is done on the body by the body force per unit 
volume Fi in V and surface traction T< in S is the power 


( 4 ) 


P = J FiVidv + J T{ VidS 
— J F^idv + J (TijVjVidS 
= J F^idv + J(aijVi)jdv. 


The first law states that 


( 5 ) 


K + E = Q + P, 


where the dot denotes the material derivative D/Dt. 

Formula for computing the material derivative of an integral has been 
given in Eq. (5.3:4). A little calculation yields 


( 6 ) 


1 Dv 2 v 2 Dp v 2 .. DS „Dp „ 

TtP^T + -F-TF + V^ dlvv + P~F7 + + <?P d lvv 


Dt ’ 2 Dt 2 r r Dt 
— hj f j 4 FiVi 4~ 4 efij . 


Dt 


On substituting the equation of continuity and the equations of motion 


Dp 

-gj+pdivv-O, 
respectively, into Eq. (6), we obtain 
(7) A 


Dvi 

p-jjt — + &ij,j 


DS 1 dhj 
P-K7 = —*r+ a v v n 


Dt 


dxj 


where 



is the symmetric part of the tensor Vij , called the rate of deformation 
tensor. The antisymmetric part of i >ij contributes nothing to the sum 
Oij v ij since <7^ is symmetric. 

In classical thermodynamics we are concerned with the small neighbor¬ 
hood of thermodynamic equilibrium. It suffices in the present chapter to 
consider infinitesimal strain, which is imposed very slowly. In this case we 
may write Eq. (7) in a form commonly used in thermodynamics books: 


( 9 ) 


pdS = dQ + <?ijdeij 
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Our Eq. (7) gives precise meaning to Eq. (9). If there is no internal entropy 
production in the process, so that d t y = 0, then the second law gives 
dQ = Tpd,y, where y denotes the specific entropy , or the entropy per unit 
mass. Hence, 

( 10 ) ▲ dS — Tdy + ~aijdeij 

Problem 12.7. In the case of a fluid under hydrostatic pressure, so that 
^ll = <722 = <733 = — p , a 12 = 023 = 031 = 0 , 

show that 

( 11 ) dS — Tdy — pdf , 

where V — 1/p is the specific volume, i.e., the volume per unit mass of the 
fluid. 

12.3. THE STRAIN ENERGY FUNCTION 

In the theory of elasticity, it is often desired to define a function 
IE(en,ei 2 ,...) of the strain components e^, with the property that* 

dW 

(1) de ■ ~ (d 3 = 1) 2,3). 

Such a function W is called the strain energy function. We shall now 
identify W with the internal energy in an isentropic process and the free 
energy in all isothermal process. 

By definition, the state of stress in an elastic body is a unique function of 
the strain, and vice versa. Hence, it is sufficient to choose one of these two 
tensors, ey and aij, as an independent state variable. Now, Eq. (12.2:10), 

(2a) dS - l -a vl de %3 + TdY , 

shows that S’ is a state function, (e t j, y), of the strain e 13 and the entropy 
per unit mass y. By ordinary rules of differentiation, we have 

(2b) ^ + ( 5 £ 1 *«- 


1 We limit our attention to infinitesimal deformations in this chapter. Then e,j is the in¬ 
finitesimal strain tensor. For finite deformations, see Chapter 16, especially Section 16.11. 
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On comparing (2a) and (2b), we see that 



Since in infinitesimal deformations the density p remains constant, Eq. (3) 
states that in a reversible adiabatic process there exists a scalar function 
pg whose partial derivative with respect to a strain component gives the 
corresponding stress components.* 

On the other hand, if the process is isothermal (T = const.), we intro¬ 
duce the Helmholtz’ free energy function per unit mass (also called Gibbs’ 
work function) &: 

(4) y ~ s - ry. 

Then, from Eqs. (4) and (2a), we obtain 

(5) dy = -ydT + Tijdeij , 


whence 



Hence, in an isothermal process there also exists a scalar function, py, 
whose partial derivative with respect to a strain component gives the cor¬ 
responding stress component in an elastic body. 

Comparing Eq. (1) with Eq. (3) or (6), we see that W can be identi¬ 
fied with pg or py, depending on whether an isentropic or an isothermal 
condition is being considered. 

A counterpart to Eq. (6), namely, 



is obtained from the Gibbs thermodynamic potential <f>, which is defined as 

( 8 ) $ = £ - T.y - -crijeij — y - Tijeij . 

For, on differentiating Eq. (8) and using Eq. (5), we have 

(9) d$ = -ydT - -eijdcTij , 

tin the corresponding Eq. (16.7:1) for finite deformation, the density po is the uniform 
density at the natural state of the material. 
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from which Eq. (7) follows. Thus, may be identified with the comple¬ 
mentary energy function W c , which has the property that 


( 10 ) 


aw c _ 

dffij y ' 


We have identified the strain energy function for two well-known ther¬ 
modynamic processes — adiabatic and isothermal. For these two processes, 
no explicit display of temperature in the function W is necessary. In other 
words, the stress-strain relation can be written without reference to temper¬ 
ature. In other thermomechanical processes the situation is not so simple; 
explicit display of temperature is required. In general, W is a function of 
eij and T. A simple example is discussed in Sec. 12.7. 


12.4. CONDITIONS OF THERMODYNAMIC 
EQUILIBRIUM 

The second law of thermodynamics connects the theory with nature in 
the statement that the case d^y < 0 never occurs in nature. It states a 
direction of motion of events in the world. According to the proverb, “Time 
is an arrow;” the second law endows the arrow to time. 

Thus, the second law indicates a trend in nature. What is the end 
point of this trend? It is thermodynamic equilibrium. In Sec. 12.1, we 
have defined thermodynamic equilibrium as a situation in which no state 
variable varies with time. For a system to be in thermodynamic equilibrium 
no change in boundary conditions is permissible and no spontaneous process 
which is consistent with the boundary conditions will occur in it. 

To derive the necessary and sufficient conditions for thermodynamic 
equilibrium, we compare the system in equilibrium with those neighboring 
systems whose state variables are slightly different from those of the equi¬ 
librium state, but all subjected to the same boundary conditions. This 
comparison is to be made in the sense of variational calculus. We shall 
write 6T, Sy, etc., to denote the first-order infinitesimals, and AT, Ay, 
etc., to denote the variations including the first-, second-, and higher-order 
infinitesimal terms. Thus, if we consider a homogeneous system A and a 
neighboring state B , we have the following variables: 

System A System B 

In equilibrium Neighboring to A 

$, eij, Oij , y , T § A&, e\jAeij, uijAaij 

y -F Ay, T+AT. 
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We shall require that the first law be satisfied so that all variations must 
be subjected to the restriction 

(1) Ai = A Q 4—(cTy 4- A(Xij)Aeij , 5A> — SQ 4 —UijSeij . 

Let us consider the following particular case. We require the system B 
to have the same energy as A and that the boundaries are rigid so that no 
work can be done; thus AS = 0, Ae,j = 0. It follows from (1) that A Q = 0 
too. Now system B has the entropy SA 4- A 5?. If any spontaneous changes 
can occur in B at all, it would occur in the direction of increasing entropy. 
Therefore, if ASA < 0, we must say that the second law would permit 
system B to change itself into the state of A. If ASA > 0, no such change 
is permitted. The second law does not say that if ASA < 0, the change 
must occur. But if we make an additional assumption — as an addition to 
the second law — that which is permitted by the second law will actually 
occur in nature , then ASA < 0 necessarily implies that B will change itself 
into A. On the other hand, if ASA < 0, the second law prohibits A from 
changing into B. Therefore, if energy variation is prohibited the necessary 
and sufficient condition for the system A to be in the state of thermodynamic 
equilibrium is 

(2) A {ASA~)g < 0. 

In other words, the entropy SA of the system A is the maximum with 
respect to all the neighboring states which have the same energy. Hence, 
the variational equation 

(3) A SA = max. (§ — const., ey = const.) 

for thermodynamic equilibrium. If we restrict ourselves to the first-order 
infinitesimals, the condition for thermodynamic equilibrium may be written 

(4) A (8jA)g < 0. 

This is the celebrated condition of equilibrium of Willard Gibbs. Gibbs 
gave an alternative statement, which is more directly applicable to contin¬ 
uum mechanics: for the equilibrium of any isolated system it is necessary 
and sufficient that in all possible variations in the state of the system which 
do not alter its entropy, the variation of its energy shall either vanish or be 
positive: 


(5) A 


(AS’)y > 0 , {8S)y > 0 . 
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In other words, the internal energy shall be a minimum with respect to all 
neighboring states which have the same entropy. 

Gibbs proves the equivalence of conditions (2) and (5) from the con¬ 
sideration that it is always possible to increase both the energy and the 
entropy of a system, or to decrease both together, viz., by imparting heat 
to any part of the system or by taking it away. Now, if the condition (2) is 
not satisfied, there must be some neighboring systems for which 

Ay > 0 and AS = 0. 

Then, by diminishing both the energy and the entropy of the system in this 
varied state, we shall obtain a state for which 

AY = 0 and AS < 0; 

therefore, (5) is not satisfied. Conversely, if condition (5) is not satisfied, 
there must be a variation in the state of the system for which 

AS < 0 and A y = 0. 

Hence, there must also be one for which 

A<f = 0 and Ay>0. 

Therefore, condition (2) is not satisfied. Taken together, the equivalence of 
Eqs. (2) and (5) is established. 

The derivation of Gibbs’ conditions for thermodynamic equilibrium re¬ 
quires a little more than the first and second laws of thermodynamics. The 
conditions also guarantee more than equilibrium in the ordinary sense — 
they guarantee stable equilibrium, stable in the sense that a neighboring 
disturbed state will actually tend to return to the equilibrium state. 

12.5. THE POSITIVE DEFINITENESS OF THE STRAIN 
ENERGY FUNCTION 

A piece of steel can exist for a long time without any spontaneous 
changes taking place in it. Thus, it is in a stable thermodynamic equi¬ 
librium condition, or said to be in a zero-stress state. When we strain a 
piece of steel with an application of forces, the strained body is in a state 
disturbed from the zero-stress one. If the metal remains elastic, the strained 
body will return to its zero-stress state upon removal of the applied forces. 
This tendency to return to the zero-stress state endows a property to a 
strain energy function which we shall now investigate. 
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Since the zero-stress state of a solid body is a stable equilibrium state, 
it follows from Eq. (12.4:5) that at constant entropy the strain energy must 
be a minimum. Consider now a strained solid. Let the internal energy of 
a strained body be § and that of an unstrained body be S’o. According to 
Gibbs’ theorem, therefore, the stability of the unstrained state implies that 
the difference AS = S — So is positive, and it vanishes only in the trivial 
case of the zero-stress state itself. Therefore, in isentropic processes 

(1) AS = S - > 0 

in the neighborhood of the zero-stress state; the equality sign holds only in 
the zero-stress state. In other words, S — So, is positive definite. 

In the case of an isothermal process, it can be deduced from the 
Helmholtz free energy function that & attains its minimum J?o, at thermo¬ 
dynamic equilibrium. A similar consideration as above shows that A& = 
& — &o is positive definite in the neighborhood of the zero-stress state in 
the isothermal condition. 

In either the adiabatic or the isothermal condition, the strain energy 
function W can be identified as p(S — So) or p(& - d?o), respectively. 
Therefore, since p > 0, the strain energy function W is positive definite in 
the neighborhood of the zero-stress state. 

From the positive definiteness of the strain energy function, the following 
important theorems are deduced. 

1. The uniqueness of the solution in elastostatics and dynamics. 

(Sec. 7.4.) 

2. The minimum potential energy theorem. (Sec. 10.7.) 

3. The minimum complementary energy theorem. (Sec. 10.9.) 

4. Saint-Venant’s principle (in a certain sense). (Sec. 10.11.) 


12.6. THERMODYNAMIC RESTRICTIONS ON THE 
STRESS-STRAIN LAW OF AN ISOTROPIC 
ELASTIC MATERIAL 

We shall consider those restrictions imposed by thermodynamics on the 
stress-strain law proposed in Sec. 6.2, which is supposed to hold in isentropic 
or isothermal conditions. 

By definition of the strain energy function, dW = Oijdeij. If we substi¬ 
tute the stress-strain law Eq. (6.2:7) into this equation, we have 

(1) dW = (XeSij + 2 Geij)deij 


= A ede + 2 , 


aa • 


e = e, 
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The elastic constants A and G differ slightly in adiabatic and isothermal 
conditions (Sec. 14.3), but this difference is of no concern here. On inte¬ 
grating Eq. (1) from the zero-stress to the strained state, we have 

(2) W — f A ede + f 2 Geijdeij ; 

Jo Jo 


i.e., 


(3) W — —-—f- Geijeij . 

This function is required to be positive definite from thermodynamic con¬ 
siderations. Since ail terms appear as squares, an obvious set of sufficient 
conditions are 


(4) A > 0, G > 0. 

But this condition is not necessary because the term is not indepen¬ 
dent of e. Therefore, let us introduce the strain deviation eC — ey — , 

whose first invariant vanishes, and rewrite Eq. (3) in the form 

(5) W = i (a + ^g)c 2 + G^ - (e tj - |e6, 

On recognizing that A + |G is the bulk modulus K and |(ey — |eJ^) 2 is 
the second invariant of the strain deviation e',-, 

( 6 ) h - \eJA 3 

we see that Eq. (5) may be written as 

(7) W = l -Ke 2 + 2 GJ 2 . 


The two groups of square terms are now independent of each other. Hence, 
the necessary and sufficient condition for W to be positive definite is 

(8) K = A + > 0, G > 0. 

Since K and G are related to Young’s modulus E and Poisson’s ratio v by 


E 


(9 > *=3(1 -2,)’ 

an equivalent set of conditions are 


G = 


E 


2(1 v ) 


( 10 ) 


E > 0, 


_1 <*' < i • 
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This result is obtained under the assumption that the stress-strain re¬ 
lationship is linear and isotropic, that the strain energy function is positive 
definite, and, tacitly, that the material is homogeneous and without hierar¬ 
chical internal structure. The values of E and v of some common materials 
are given in Sec. 6.2, Table 6.2:1. Common steels have v about 0.25, alu¬ 
minium alloys have v about 0.32; but the value of v for beryllium is near 
zero (0.01 - 0.06), and for lead is near 0.45. For some polymer foams v 
can be greater than 1 or smaller than —1, see papers by Lakes and Lee and 
Lakes listed in Bibliography 12.1. 

Smart materials have fine ultrastructures whose states influence the 
overall mechanical properties of the materials. Hence their state variables 
are not limited to stress, strain, and temperature. Therefore, they are 
not bounded by the Eq. (10) which is derived under the restricted set of 
state variables. This raises the question: under what condition can the 
ultrastructure be ignored? There is no simple answer, especially in a field 
like biomechanics, in which organs, tissues, cells, subcellular units, and 
molecules form successive hierarchies. In fact, investigations of questions 
at the borderlines of neighboring hierarchies could be most rewarding. 


12.7. GENERALIZED HOOKE’S LAW, INCLUDING THE 
EFFECT OF THERMAL EXPANSION 

We shall now consider the stress-strain law in a general thermal envi¬ 
ronment, not limited to isothermal or isentropic conditions. Let us assume 
that the strain energy function W can be expanded in a power series 


(1) 2 W = Co + Cijeij + C'^ ki eijeki + • • • , 


where the coefficients Co, Cy, etc., are functions of the entropy 5? and the 
temperature T. Since W = 0 when e u = 0 by definition, the constant Co 
must vanish. As to Cy, we note that since 


(?ij 


dw 

detj 


Gy 

2 


+ 0(ey), 


Ca means twice the value of the stress component ay at zero strain. Such 
stresses at zero strain can be brought about by a linear expansion if there 
are temperature changes from a standard state. The simplest assumption 
is to let Cij be linearly proportional to the temperature change: 


Cij = -2fly(T - To) , 


where are numerical constants. Hence, we have, when e t] are small and 
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terms higher than the second order are neglected, 

(2) A W — —(3ij(T - To)eij + -C'^fci e y' e fci • 

The stress components are, therefore, 

dW 1 

(3) aij = d^ = 2 {C ^ kl + C>klii)ekl ~ Pij{T ~ To) • 

This is the general Duhamel-Neumann form of Hooke’s law. If the 
quadratic form in Eq. (2) is symmetrized in advance, we can write Eq. (3) 
in the form 

(4) A crij = Cijkieki - PijiT - To) , 

where 

(5) C ijk i = Ckiij 

is a tensor of rank 4. Since and ay are symmetric tensors, we also have 

(6) (\jkl — J'lkl ! Cijki — Cijlk , Pij ~ Pji • 

Thus, for an anisotropic medium there are 21 independent elastic constants 
C^ki , and six independent constants Pij. This number is reduced to two 
C’s and one 8 if the medium is isotropic. The relationship for an isotropic 
material is 

(7) A ay = Xedij + 2G eij - P(T - T 0 )<Sy , 

or 

(8) A eij = ——sSij H — <7ij + a(T — To)Sij , 

where 
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Whereas the complementary energy function is 


W, = — 
c 4 G 


1 + v 


+ a(T - T 0 )s. 


Here a is the linear coefficient of thermal expansion, To the temperature 
of a reference state of the body, T - To, the rise of temperature above the 
reference state. 

The strain energy function must be a relative minimum at the zero- 
stress state (see Sec. 12.4), which implies that, when T = To, the quadratic 
form 


( 12 ) 


If — 2 


must be positive definite. 


PROBLEMS 


12.8. State the conditions which must be satisfied by the constants Cijki in 
order that the quadratic form (12) be positive definite. 

12.9. Let T = To, and derive Clapeyron’s formula for a linear elastic 
material: W = jffyey. 

12.10. In a conventional structural analysis, honeycomb sandwich material 
is often treated as a homogeneous orthotropic material, provided that the di¬ 
mensions of the structure are much larger than the dimensions of the individual 
honeycomb cell. Suggest a stress-strain law for an orthotropic material. Deter¬ 
mine explicitly the limitations imposed on the elastic constants that appear in 
the stress-strain law by the first and second laws of thermodynamics. 

12.8. THERMODYNAMIC FUNCTIONS FOR ISOTROPIC 
HOOKEAN MATERIALS 

It will be interesting to record the thermodynamic functions for an 
isotropic elastic material. Restricted to infinitesimal strains, the deriva¬ 
tion of the formulas is quite simple. Consider a perfectly elastic, isotropic 
body which obeys the Duhamel-Neumann generalization of Hooke’s law, 
Eq. (12.7:7). In a reversible, infinitesimal deformation, we have 

( 1 ) d& = -ydT + Tijdeij , 

where T, <?, y, and & = S’ - Ty are, respectively, the absolute tempera¬ 
ture, internal energy, entropy, and Helmholtz free energy per unit mass. If 
we substitute Oij from Eq. (12.7:7) into Eq. (1), we obtain 

(2) pdy = -pydT+\ede-0(T-To)de+2Geijdeij , e = e aa , 
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which can be integrated with respect to the strain to yield 

(3) = T 0 )e + Ge^ + C t (T ), 

where Ci(T) is a function of T. To evaluate G\, we first note that Eq. (1) 
implies that (dy /dT) eij — ~y [see Eq. (12.3:6)]. Hence, from Eq. (3), 


(4) 


P y = 


<?d\ 
~2 dT 


^e ij ~+e~mT-T 0 )} 


dC i 
dT ' 


Now, the function Ci, can be expressed in terms of the heat capacity per 
unit mass at constant strain C v . For, on the one hand, since '/ J is a function 
of T and e 13 , we have that, if de v = 0, 


(5) dQ^Tpdy = Tp(~'\ dT. 

\ / dj 

On the other hand, we have, by definition, that at zero strain 


( 6 ) 


dQ = pC v dT. 


Hence 



which relates C v , with C\, when y is substituted from Eq. (4). Simple 
relationships are obtained if C v , is evaluated at zero strain. Then 


( 8 ) 



T d 2 C\ 
p dT 2 ’ 


or, if we assume that y and & vanish for zero strain at T — To, 



Now, the internal energy pS per unit volume can be obtained. By 
Eqs. (3), (4) and (12.6:3)-(12.6:9), we have 


pS = p{& + Ty) 


1 2 
= —e 

2 


+ e 


K-T 


dK 

dT 


+ 2 J 2 [G-T^ 


pTo + T(T-To) 


+ Ci 


T dCi 

dT ‘ 


(10) 



Sec. 12.9 


EQUATIONS CONNECTING THERMAL AND . . . 


425 


If A, G, (3, and C v , are independent of temperature, Eqs. (9) and (10) are 
simplified to 

(11) Cx = ~pC v T 0 (~ log|-| + l), 

( 12 ) P S = l -Ke ‘ 2 + 2 GJ 2 + 0T o e + p{C v ) eij ^{T - T 0 ), 

where is the second strain invariant: 

(13) J 2 = 2 ~ 3 ^ 0 ^ — ' 

Problem 12.11. Starting with the equation 

(14) = -ydT - - eijduij , 

P 

where $ is Gibbs’ thermodynamic potential per unit volume, show that 

(15) ^ = Je * 2 “ ~ a(r “ To)s + C2(r) ’ 



where 3 = <Tjj and C p , is the heat capacity per unit mass at constant stresses. 

12.9. EQUATIONS CONNECTING THERMAL AND 
MECHANICAL PROPERTIES OF A SOLID 

It is possible to derive relations which connect the specific heats, the 
moduli of elasticity, the latent heats of change of strain or stress at constant 
temperature, etc. These results were given by Lord Kelvin and by Gibbs. 

The specific heat at constant strain and the various latent heats of 
change of strain are defined by the following equation. 

(1) dQ - C v dT + 4 "dey , 

where dQ is the heat required to change the temperature of a unit mass by 
dT and the strain by de^, C v is the specific heat per unit mass measured in 
the state of constant strain, and 4 y are the six latent heats per unit mass 
due to a change in a component of strain (in each case the temperature 
and the remaining five strain components are unchanged). Now, under 
reversible conditions, the change of entropy is 

dQ C„ 4-. 
dy = = -fdT + -^-d eij . 


( 2 ) 
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Hence, 


(3) 



C v 



But from Eq. (12.3:6) we have, by further differentiations, 

d,9 d 2 9 1 do i: j d 2 ,9 1 d 2 a tJ 

de K1 ~ ~ dTdeij ~ " p dT ' de l} &T ~ “ p dT 2 ' 

It follows that 

= dC v T d 2 a ij 

K ’ e ' j p dT ’ deij p dT 2 ' 


In a similar manner we may choose the temperature and the six com¬ 
ponents of stress as thermodynamic variables and define the specific heat 
per unit mass at constant stress and the latent heats per unit mass due to 
change of stress by an equation analogous to Eq. (2), 


(5) 


dy = ^dT+^L aii do i: 


The specific heat C p (T,i r) is more practically obtainable under usual 
conditions of measurement where the external forces on the solid are un¬ 
changed. The latent heats L ail , etc., are defined each at a constant tem¬ 
perature and with five of the stress-components unchanged. 

A similar analysis with Gibbs’ potential $ leads to the results 

_ dCp _ T d 2 e lj 

U ati ~ p dT ’ dan p dT 2 ' 


The difference between the two specific heats can be expressed as follows. 
Let an infinitesimal change take place at constant stress. The change of 
entropy can be expressed in two ways. By Eq. (2), it is 


dy = ^dT + ±i eij ^±dT- 

by Eq. (5), it is 

dy = ^ dT. 

Equating these two expressions, we obtain 
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By Eq. (4), this becomes 


( 8 ) 


C p -C v 


Tde ij {T,a)do ij {T,e) 
p dT 8T 


Problem 12.12. Prove the following reciprocal relations. 



_ dt. hl 

dcTij _ 

dan 

de k i 

de.ij ' 

de ki 

de^ 



deij 

de kt 

daki 

dan ’ 

d<TM ~ 

d&ij 
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IRREVERSIBLE THERMODYNAMICS 
AND VISCOELASTICITY 


Problems involving heat conduction, viscosity, and plasticity belong to the 
realm of irreversible thermodynamics. In this chapter we shall outline the 
methods of irreversible thermodynamics, limiting ourselves to linear pro¬ 
cesses. Biot’s treatment of relaxation modes and hidden variables will 
be presented and applied to the stress-strain relationship of viscoelastic 
materials. 

13.1. BASIC ASSUMPTIONS 

The classical thermodynamics of Chapter 12 deals essentially with equi¬ 
librium conditions of a uniform system (or a heterogeneous system with 
uniform phases). As far as irreversible processes are concerned, it contains 
little more than a statement about the trend toward thermodynamic equi¬ 
librium. To bring the irreversibility to a sharper focus, we would like to 
be able to write down definite “equations of evolution” (or equations of 
“motion”), which describe in precise terms the way an irreversible process 
evolves. To make the theory useful to continuum mechanics, we must ex¬ 
tend the formulation to include nonuniform systems. These two items are 
the main objectives of the theory of irreversible thermodynamics. Since 
these objectives are beyond the scope of classical thermodynamics, some 
new hypotheses must be introduced, whose justification can only be sought 
by comparing any theoretical deductions with experiments. 

The first assumption is that the entropy is a function of state in ir¬ 
reversible processes as well as in reversible processes. In Chapter 12, we 
considered reversible processes and showed that the value of the entropy of 
a system can be computed when the values of the state variables are known 
(except for an integration constant which is, in general, of no importance). 
(Near zero absolute temperature, the additive constant of entropy does 
become important, but in that case its value can be fixed according to 
Nernst’s theorem — the third law of thermodynamics.) We now assume 
that this same function of state defines entropy even if the system is not in 
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equilibrium. The motivation for this assumption is analogous to our atti¬ 
tude toward the mass distribution of any mechanical system: once the mass 
of a particle in a continuum is determined, we assume that it remains the 
same no matter how fast the particle moves. 

Just as the constancy of mass must be subjected to relativistic restric¬ 
tions, the validity of the assumption of entropy as a state function must 
be restricted to relatively mild processes. Prigogine has investigated this 
question by comparing our assumption with results of statistical mechan¬ 
ics for some particular models, such as the Chapman-Enskog theory of 
nonuniform gases. He shows that the domain of validity of this assump¬ 
tion extends throughout the domain of validity of linear phenomenological 
laws (Fourier’s law of heat conduction, Fick’s law of mass diffusion, etc.). 
In the case of chemical reactions, he shows that the reaction rates must 
be sufficiently slow so as not to perturb Maxwell-Boltzmann equilibrium 
distribution of velocities of each component to an appreciable extent. This 
means, for example, that the temperature changes over the length of one 
mean free path must be much smaller than the absolute temperature itself; 
and the like for other properties. 

The second assumption consists of an extension of the second law of 
thermodynamics locally to every portion of a continuum, whether it is uni¬ 
form or nonuniform, and may be explained as follows. We notice first that 
entropy is an extensive quantity, so it must be subjected to a conservation 
law: for a given set of particles occupying a domain V the total change 
of entropy must be equal to the total amount of entropy transferred to 
these particles through the boundary, plus the entropy produced inside 
this domain. Let the boundary of this domain be denoted by B. Then we 
may write 

« §tJ v » ydV ‘-J B ' i ' dA + §iJ v ^ dv ' 

where 5? is the specific entropy per unit mass, is the entropy source 
(internal entropy production per unit mass), <j> is the entropy flow vector 
on the boundary, dA is a surface element whose vector direction coincides 
with the outer normal, so that <f> ■ dA is a scalar product representing 
the outflow , which is equal to the normal component of <j> times the area 
dA. The material derivative D/Dt is taken with respect to a given set of 
particles. On transforming the second term into a volume integral and using 
Eq. (5.3:4) to reduce the material derivative of an integral, and realizing 
that the domain V is arbitrary, we obtain 

'•isr + y {wt + ' ,di,v ) = - div '»+'>%f + ‘ y (wt + ' ,divv ) 
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where v is the particle velocity vector of the flow field. The sum in the 
parentheses vanishes by the equation of continuity, Eq. (5.4:4). Hence, we 
have 

, x Dy ,. ; Di.y 

(2) k p_. = -d,v* + ,,- gr . 

We can now state the second hypothesis (an extension of the second law of 
thermodynamics) as follows. 


(3) 

▲ 

4> 

h 

- T , 

h = heat flux vector, 

(4) 

▲ 

Diy 

Dt 

= 0 , 

reversible processes, 

(5) 

▲ 

Dty 

~dT 

> 0 , 

irreversible processes, 

(6) 


D,y 

Dt 

< 0 , 

never occurs in nature 


In Eq. (3), h is the heat flux vector representing the heat flow per unit area 
and T is the local absolute temperature. In a moving medium, h is defined 
convectively, i.e., with respect to elements of surfaces that are composed 
always of the same sets of particles. 

When Eqs. (3)-(5) are compared with Eqs. (12.1:6) and (12.1:7), we see 
that the entropy production is now required to be nonnegative everywhere 
in the system. Such a formulation may be called a local formulation of 
the second law in contrast to the global formulation of the classical thermo¬ 
dynamics. 

Corresponding to f> we have a vector field <f> which is the entropy dis¬ 
placement field introduced by Biot 13 1 (1956). If we require that 

(7) <f> = 0 at t = t 0 , 

then the points of the body at time t > f 0 are assigned an entropy displace¬ 
ment vector <f>, just as they are assigned a material displacement vector 
u. The two vector fields u and <j> connect the state of the body at time 
t > to with the natural state at to. The material displacement field defines 
the reversible change of mass distribution by -div(pu); the entropy dis¬ 
placement field 4> defines a reversible change of entropy by — div <f>. The 
material displacement field can be analyzed through the displacement gra¬ 
dient tensor the entropy displacement field can be analyzed through 
the second order tensor 
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If we introduce a notation analogous to that of Eq. (12.1:5), we may 
write the rate of reversible change of entropy as 
D 

( 8 ) p ~br = - div<6 - 

Further development of irreversible thermodynamics requires a detailed 
description of the entropy production DiJ?/Dt. To this end we shall first 
consider a simple example in the following section. 

13.2. ONE-DIMENSIONAL HEAT CONDUCTION 

Consider the heat transfer in a slender solid bar with continuous tem¬ 
perature gradient in the direction of the lengthwise axis of the bar, x; 
Fig. 13.2:1. The temperature is assumed to be uniform in each cross sec¬ 
tion of the bar, and the walls (except the ends) are thermally insulated. It 
will be assumed that the heat flow takes place in the direction of the tem¬ 
perature gradient, so that the problem is mathematically one-dimensional. 
Furthermore, we assume that the bar is free of stresses and that the cross 
section is of unit area. 



nx) 
STM 
y{x) 


+h + §dx 


>-x 


Fig. 13.2:1. Heat conduction. 


Let T(x), ${x), y(x) denote, respectively, the temperature, the energy 
per unit mass, and the entropy per unit mass of the solid. Let h denote the 
heat flux per unit area per unit time in the ^-direction; i.e., if Q represents 
the heat transported to the right across a unit cross-sectional area, then 



Now consider the changes occurring in a small element of length dx in a 
small time interval dt. The net increment of heat in this element is evidently 


dQ = hdt — + ^dx^j dt = -^~dxdt. 


The increase in energy in this element is {pD£/Dt)dtdx. The stresses being 
zero, the first law states that the quantities above must be equal. Cancelling 
the factor dx dt we obtain 
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( 2 ) 


D£ _ dh 
P Dt ~ dx' 


The change of entropy in this element is caused by the addition of heat dQ 
alone. Hence, 


(3) 


US’ , , 

P~fy£ dxdt 


dQ 

T 


Idh, * 
= ~Tai ixdt 


Hence, 

py _ 1 dh _ d (h \ 1 8 T 

(4 ' P Dt Tdx~~ dx \T) r 2 dx ' 


This equation is of the form of Eq. (13.1:2). The entropy flow <j> may 
be identified with h/T. The entropy production per unit time per unit 
volume is 


D % y _ f^dT 

P Dt T 2 dx' 


The postulate that DiY/Dt must be positive implies that h has an oppos¬ 
itive sign to that of dT/dx; i.e., the heat flows in a direction against the 
temperature gradient. 


Problem 13.1. Generalize the results above to the three-dimensional case: 
the entropy production per unit volume is 

D t y (h x 8T hy OT h z dT\ _ h grad T 

A P Dt \T 2 dx + T 2 dy + T 2 dz) T T ' 

where h x , h v , h z , are the three components of the heat flux vector h, i.e., h x , is 

the heat flux per unit area across a surface element normal to the x-axis, etc. 


13.3. PHENOMENOLOGICAL RELATIONS — ONSAGER 
PRINCIPLE 

In the preceding section it was shown that in heat conduction the en¬ 
tropy production can be written as the product of a “generalized force” 
—(grad T)/T and the corresponding flux h/T. This is true also for other 
types of irreversible processes such as diffusion, chemical reactions. In 
chemical reactions the “forces” and “fluxes” are often termed affinities and 
reaction rates. In general, then, we may write, for an irreversible process, 

D t y r—v 

~Dt~ ~ JkXk > 0 > 

k 


(i) 
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where Jk, denotes the fcth flux and Xk the corresponding generalized force. 
For example, in heat conduction 

(2) t* fc = 1>2 ’ 3 - 

At thermodynamic equilibrium all processes stop and we have simulta¬ 
neously for all irreversible processes 


(3) 


Jfc = 0, Xk — 0, at equilibrium. 


It is quite natural to assume, at least in the neighborhood of an equilib¬ 
rium condition, that the relations between generalized forces and general¬ 
ized fluxes are linear. Fourier’s law for heat conduction is such an example. 
Linear laws of this kind are called phenomenological relations. For such 
phenomenological relations, an important symmetry is known, among the 
coefficients that appear in these relations. This symmetry is embodied in 
the so-called Onsager principle. 

Let us consider heat conduction again. It has been known for a long 
time that in anisotropic crystals a symmetry exists in the heat conduc¬ 
tion coefficient matrix, which could not be explained by crystallographic 
symmetry properties. It was found that when we write 


(4) 


r dT _ dT dT 

hx = Lll te + Ll2 d^ + Ll3 dJ 

_ dT _ dT dT 

hy = 1/214 " ^23 ' 


1 dz 


. _ dT _ dT dT 

hz = Ln fa +i32 a^ + i33 a7 


the matrix Lki, is always symmetric: 


( 5 ) Lki = Lik , l,k = 1 , 2 , 3 , 

for arbitrary orientations of the coordinate axes, irrespective of the crystal¬ 
lographic axes. Considerable effort has been spent on accurate experiments 
to detect any deviation from relation (5). The most ingenious experiments 
are those of Soret 13 - 1 and Voigt. 13 - 1 The symmetry relation is always veri¬ 
fied with great accuracy. 

A similar symmetry property occurs in other phenomenological relations 
describing interferences between several simultaneous irreversible processes. 
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For example, consider the case of one-dimensional thermodiffusion for which 
two phenomenological relations may be written as 

Ji = LuXi + L12X2, 

(6) 

J2 — £21X1 T L22X2, 

where J\ represents heat flux, J2 represents the mass flux of a particular 
component in a mixture, X\ represents the temperature gradient, and X2 
represents the concentration gradient of that particular component. Then 
L 11 , is the heat conductivity, L 22 is the diffusion coefficient, and L 12 and L 21 
are coefficients describing the interference of the two irreversible processes 
of heat conduction and diffusion. The coefficient L 21 is connected with 
the appearance of a concentration gradient when a temperature gradient 
is imposed, called the Soret effect. The coefficient L 12 is connected with 
the inverse phenomenon, the Dufour effect , which describes the appearance 
of temperature difference when a concentration gradient exists. The first 
equation in Eq. (6) is a modification of Fourier’s law of heat conduction to 
include the Dufour effect. The second equation in Eq. (6) is a modification 
of the Fick’s law of diffusion to include the Soret effect. In this case also, 
we have 

L \2 — L21 ■ 

Other linear phenomenological laws verified by experiments include 
Ohm’s law between electrical current and potential gradient, Newton’s law 
between shearing force and velocity gradient, the chemical reaction law be¬ 
tween reaction rate and chemical potentials, etc. The reciprocal phenomena 
of thermoelectricity arising from the interference of heat conduction and 
electrical conduction include the Peltier effect (discovered in 1834), which 
refers to the evolution or absorption of heat at junctions of different metals 
resulting from the flow of an electric current, the Seebeck effect (discovered 
in 1822), which relates to the electromotive force developed in a circuit 
made up of different conducting elements, when not all of the junctions 
are at the same temperature, and the Thomson effect (discovered by Lord 
Kelvin in 1856), which refers to the reversible heat absorption which occurs 
when an electric current flows in a homogeneous conductor in which there 
is a temperature gradient. 

The simultaneous action of several irreversible processes may cause in¬ 
terferences as typified by the examples named above. As a general form 
of phenomenological laws, let Jk(k = 1,2, ...,m) represent the general¬ 
ized fluxes (heat flow, electric current, chemical reaction rate, etc.), and 
let Xk(k = 1,2, ...,m) represent the generalized forces (temperature 
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gradient, electric potential gradient, chemical affinity, etc.). Then a lin¬ 
ear phenomenological law is 


(7) J k = Y^LuXi , k = 

;=i 

To this general form, Onsager principle now states a symmetry property of 
the coefficients Lki as follows. 

When a proper choice of the fluxes Jk and, the forces Xi, is made, so 
that the entropy production per unit time may be written as 


(8) 


D& 

Dt 


fc=i 


and if the flux and forces are related by linear phenomenological relations, 

m 

j k = Y2 LkiXi ’ 

i=i 


then the matrix of the coefficients Lki is symmetric; i.e., 


(9) 


Lki = Lik , 


k,l = 1,2,... ,m. 


The identities (9) are called the Onsager reciprocal relations. 

On substituting Eq. (9) into Eq. (8), we obtain a quadratic form 


( 10 ) 


D t y 

Dt 




This quadratic form has to be positive for all positive or negative values of 
the variables Xk except when X\ = Xi = • • • = 0, in which case the entropy 
production vanishes. The necessary and sufficient conditions which must 
be satisfied by the coefficients L k i in order that the quadratic form (10) be 
positive definite are well known in algebra and matrix theory. 

Onsager derived (1931) this principle from statistical mechanics consid¬ 
erations under the assumption of microscopic reversibility; i.e., the sym¬ 
metry of all mechanical equations of motion of individual particles with 
respect to time. It furnishes an explanation of the observed relation (5) of 
the heat conduction for anisotropic crystals. The statistical derivation may 
be found in Onsager’s papers, or in Prigogine 13 1 and De Groot’s 13 1 books. 

Casimir 13 ' 1 and Tellegen classified irreversible processes into two classes 
according to whether 


(11) J k {t) = Jfc(-t) 

(12) J k {t) = -J fe (-t) 


Jk an even function of time , 
Jk an odd function of time. 
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Heat flow, mass flow, and so on, defined by expressions such as SQ/St, 
are odd functions of time. Prom the point of view of the kinetic theory of 
matter, J is even or odd according to whether J does or does not change sign 
as particle velocity is reversed. So far, we have considered cases in which 
generalized flows are so chosen that they are all odd. In the more general 
case, Onsager’s principle should be generalized to state: if the irreversible 
processes i and k are both even or both odd, then 

(13) Lik — Lki ■ 

If one is even and the other is odd, then 

(14) Lik Lki ■ 

A further modification is necessary if there exist forces depending explicitly 
on the velocity, such as Lorentz forces, Coriolis forces, and so on. For exam¬ 
ple, in the presence of a magnetic field, H, two odd or two even processes 
are related by: 

(15) L ik { H) = L ki {- H). 

The generalized theorem is sometimes called the Onsager-Casimir theorem. 

For simultaneous action of several irreversible processes a general phe¬ 
nomenological law may be written in the form discussed in Sec. 13.3. The 
Onsager-Casimir theorem simplifies these laws. A further simplification 
exists in the form of certain symmetry requirements which may dictate 
whether certain interference coefficients should vanish or not. This is 
Curie’s symmetry principle [P. Curie, Oeuvres (Paris: Societe Frangais de 
Physique 1908)], which states that macroscopic causes always have fewer 
elements of symmetry than the effects they produce. An interference or the 
irreversible processes is possible only if this general principle is satisfied. 
For example, chemical affinity (a scalar) cannot produce a directed heat 
flow (a vector) and the interference coefficient between the heat flow and 
chemical affinity must vanish. 

13.4. BASIC EQUATIONS OF THERMOMECHANICS 

Thermomechanical systems are subjected to the same general conserva¬ 
tion laws with regard to mass and momentum as were discussed in Chap¬ 
ter 5. However, the law of conservation of energy contains both mechanical 
and thermal energy. The change of thermal energy is related to the change 
of entropy. Thus, a complete description of the evolution of a system re¬ 
quires a knowledge of the entropy production. Phenomenological relations, 
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together with the Onsager principle, provide sufficient details about the 
entropy production. These laws, taken together, determine the evolution 
of the system. 

To demonstrate the procedure let us consider a solid body occupying a 
regular domain V with boundary B. A set of rectangular Cartesian coor¬ 
dinates will be used for reference; the instantaneous position of a particle 
will be denoted by (xi, x?, X3). Equations will be written with respect to 
the instantaneous configuration of the body, i.e., in Eulerian coordinates. 

The conservation of mass is expressed by the equation of continuity 
(5.4:3) 


( 1 ) 


dp dpvi 
dt dxi 


The conservation of momentum is expressed by the Eulerian equation of 
motion (5.5:7), and Cauchy’s formula (3.3:2) 


(2) 

Dvi 

P — a ij,] + P^i 

V 

in V , 

(2a) 

Fi — (y% jfj 

on B , 

(2b) 

(Tij — <7ji 

in V + B . 


The conservation of energy is given by Eq. (12.2:7) 


( 3 ) P-jg = PijVij - hi,i. 

In these equations, p is the mass density, (vi,V 2 ,V 3 ) is the velocity vector 
of a particle, D/Dt is the material derivative 


( 4 ) 


D_ _ d_ d_ 
Dt dt Vj dxj ’ 


V 

Fi is the body force per unit mass, T, is the surface traction per unit area, 
§ is the internal energy per unit mass, <7 tJ is the stress tensor, hi is the 
heat flux vector, and a comma followed by i indicates partial differentiation 
with respect to Xi. The indices i, j range over 1, 2, 3; and the summation 
convention is used. 

Next we need the entropy balance equation. According to the first 
assumption named in Sec. 13.1, we shall assume, for a solid body, that the 
specific entropy 5? per unit mass is a function of the internal energy § and 
the strain tensor ey, 


( 5 ) 


y = y{g, eij). 
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This is also expressed by the fact that, in equilibrium, the total differential 
of y is given by Gibbs’ relation (12.2:10) 


( 6 ) 


pTdy = pd(D — (Jijdeij . 


Since our hypothesis asserts that y is related to <? and e X j in the same way 
even if the system is not in equilibrium, it follows that along the path of 
motion 


(?) 


T oy _ ds T/ 

pT Dt ~ P Dt ijVij 


where V t] = ^(vij + Vj t i) is the symmetric part of Vij , called the rate of 
deformation tensor. On substituting Eq. (3) into Eq. (7), we obtain 


( 8 ) 


T Dy h 

" T ~m = -»* 


or 

(9) 


py __ hi±_ fhA 7 (i 

P ~Dt ~ ~'Y ~ ~ \T ) z ~ hi T 2 ' 


The first term on the right-hand side is the divergence of the entropy flow, 
the second term is the entropy production which must be positive. This 
result is in agreement with what was discussed in Sec. 13.2. 

The entropy production -hjTf/T 2 is a product of the flux hi/T, and 
the “force,” —T t /T. A phenomenological law relates the flux to the force 
in the form 


(10) 

hi = -hjTj 

where 


(11) 

kji . 


This is all we can get from the general considerations of mass, momen¬ 
tum, energy, and entropy. Equations (1) through (11), taken together, still 
do not define the strain field uniquely. To complete the formulation, a con¬ 
stitutive law must be added, which relates the stress tensor Oi 3 to the strain 
tensor eij. It can be verified that with the addition of such a constitutive 
law, a sufficient number of differential equations are obtained for which 
boundary value problems can be formulated. 

As another illustration let us consider a body of fluid instead of a solid. 
A fluid is distinguished from a solid by the fact that it cannot sustain shear 
stress without motion. In an equilibrium condition, the internal energy 
must depend only on the volumetric change. Hence, instead of Eq. (5), we 
now make the assumption 

(12) y = y(£’,e), 
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where 


(13) e = eu = Ui,i 

is the first invariant of the strain tensor. Correspondingly, the stress tensor 
is separated into a pressure and a stress deviator: 


(14) 

(15) 


@ij — P&ij d" &ij ) 



The Gibbs’ relation in thermostatics (12.2:11) is now generalized to read, 
according to Eq. (12) and our basic assumptions listed in Sec. 13.1, 


(16) 


pT 


Dy D£ De 


Dt 


= P 


Dt 


+ P 


Dt 


A substitution of Eq. (3) yields 

Ds? 

(17) pT ~Dt = ~ ki,i + a ' ijVi ' j ~ m,i + m,i ' 

Hence, 

. Dy (hi\ ,Ti 1 , 

(18) P m - (^ r J. h i T 2 + T <T ij v ',i- 

The first term on the right-hand side of (18) is the divergence of the entropy 
flow. The last two terms have the significance of entropy production, 

, Dty , T, i , 

(19) P ~ hirp2 T °ii V ij • 

Now Vi j can be split into a symmetric part and an antisymmetric 
part u>ij, called the rate of deformation tensor and the vorticity tensor 
respectively, 

( 20 ) Vij = Vij + a>ij , 

(21) = Vji, uiij = ~ajji. 

(22) Vij = -(u t) j + Vjj ), u>ij = — ( v ij — v j,i) 

The contraction of the symmetric stress tensor o-j with the antisymmetric 
tensor u >ij vanishes. Hence, o'ijVij = cr'ijVj and Eq. (19) becomes 


P 


D { y 

Dt 


J_5T , 2. / 
'T 2 dxi + T aij 


(23) 
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The linear phenomenological laws assume the form 


(24) 

(25) 




Ste* - 


r {3) 1 8T (4) v 

a ij ^ijk rp Q Xk + ^ijkl V kL i 


where, according to Onsager’s principle, the symmetry relation 
(26) C$ = C$ 


prevails. Equation (25) is a law for viscous flow. The coupling terms 
involving C\Jl, Cty express the possible interference between viscous flow 
and heat conductions. 1 

In the case of a fluid, the Eqs. (l)-(3) and (12)-(26) provide the right 
number of field equations for the determination of the flow field. 


13.5. EQUATIONS OF EVOLUTION FOR A LINEAR 
HEREDITARY MATERIAL 

The remainder of this chapter will be devoted to the question of stress- 
strain law for a linear hereditary material, following a treatment first given 
by Maurice A. Biot 13 1 (1954). We have in mind such materials as poly¬ 
crystalline metals, high polymers, etc. When such a material is subjected 
to a variable strain, many things happen inside, which, however, are not 
explicitly observed in formulating the stress-strain law. For example, when 
a polycrystalline metal is uniformly strained in a macroscopic sense, the in¬ 
dividual anisotropic crystals are strained differently, and thermal currents 
that circulate among the crystals are generated. The interstitial atoms 
move in the crystals or among the crystals. These processes can be ac¬ 
counted for explicitly. However, often we are not interested in them. Our 
interest may be limited to the extent of their interference with the defor¬ 
mation. They are the “hidden variables” in the stress-strain relationship. 

We follow Biot to formulate a problem as follows. Consider a sys¬ 
tem I with n degrees of freedom defined by n state variables <?i, <J 2 > • • • > ?n- 
These q’s may represent strain components, local temperature, piezoelectric 
charges, concentrations such as induced by chemical or solubility processes, 
etc. It is assumed that the system is under the action of generalized external 
forces denoted by Qj , the units and senses of which are such that for each 
j = 1,2,...,n, Qjdqj (j not summed) represents the energy furnished to 

tit follows further from Curie’s principle that C^l = = 0. See p. 436. 
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the system when qj is changed by an amount dqj. These forces may be ex¬ 
ternally applied forces, electromotive forces, or may result from deviations 
of the Gibbs and chemical potential from the equilibrium state. Inertia 
forces, if any, are considered as external forces according to D’Alembert’s 
principle. 

Let the system I be adjoined to a system II which is a large reservoir at 
constant temperature To, and let the total system I + II be insulated. The 
variables qj will be defined as the departure from the state of thermody¬ 
namic equilibrium; i.e., q 0 = 0 at the condition of equilibrium at which the 
temperature is uniform and equal to To- 

Let the system I be given an initial disturbance from equilibrium. The 
forces Qj as functions of time are given. If all coordinates qi(t) are deter¬ 
mined for t > 0, we say that the evolution of the system is known. The 
time history of qj(t) is said to be a trajectory. The differential equations 
that describe the evolution of qj(t) from their initial values are called the 
equations of evolution. 

A little reflection will show that if we consider the body represented by 
the system I to be macroscopically homogeneous and subjected to macro- 
scopically homogeneous stresses, we may let q\, qi ,..., qe, represent the six 
independent strain components en,ei 2 = e 2 i,...,e 33 , and Qi the corre¬ 
sponding stress components. The relations between Qi,...,Qe and 
qi,...,qe are influenced by the other q’s and Q’s. However, if all other 
Q’s vanish and if we can eliminate all q’s other than the first six, then the 
stress-strain relationship of the material is obtained. It will be seen that 
the influence of the hidden variables is revealed in the hereditary charac¬ 
ter of the material. In the present section we shall derive the equations of 
evolution. The solution of free evolution following an initial disturbance 
will be discussed in Sec. 13.6, and the forced motion and the elimination of 
hidden variables will be treated in Secs. 13.7 and 13.8. Biot’s results about 
viscoelastic material will be presented in Sec. 13.9. 

We shall now consider what happens in a small time interval. Let the 
change of the coordinate qj in a time interval dt be denoted by dqj. Ac¬ 
cording to the first law of thermodynamics, the heat, 12(1), required by the 
system I to bring about a change of state dqj is the difference between the 
change in internal energy S\ and the work done. Without loss of general¬ 
ity, we shall write the following equations under the assumption that the 
system I is of unit mass: 


d£ m =dSi- Qjdgj. 

3 =1 


( 1 ) 



442 


IRREVERSIBLE THERMODYNAMICS AND VISCOELASTICITY Chap. 13 


Since the total system I + II is insulated, we have 

AS® + =0, 


where dJS 1 - 111 denotes the heat received by system II. According to our basic 
assumption stated in Sec. 13.1, Gibbs’ relation for change of entropy holds. 
Hence, the change of entropy of system II is 



The change of entropy of the isolated total system I + II is 


( 2 ) 


diS 1 +n = dS*i + dy n = dJ\ - 


dS [ 



This is the entropy production in the time interval dt. Since and <£/, 
are functions of state, the total differentials of d^i and dS\ are 




Hence, 

(3) 



According to the second law of thermodynamics, diS^i+u is nonnegative. If 
we write dqj/dt for the generalized flux Jj of the previous section, we have 
the following general expression for the entropy production: 


(4) ▲ 


^ ^ Xjdq 3 , 

3 = 1 


where Xj are the generalized force conjugate to qj in the entropy produc¬ 
tion. To distinguish Xj from the “generalized external forces” Qi, we shall 
call Xj the generalized entropy-production forces. We remark that Xj, may 
be identically zero for some index j. Hence, the number of significant terms 
in the sum on the right-hand side of Eq. (3) may be less than n. 

Identifying the general expression (4) with Eq. (3), we see that in our 
problem 


(5) A 
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Now, if the phenomenological laws connecting Xj, and q->, are linear so that 

1 " 

(6) A Xj — — bjkQk ) 

lo t^i 

then Eq. (5) becomes 

(7) A - S-(T 0 ^ - Si) = Qj , j = 1,2,... ,n, 

k Q] 

which is the equation of evolution. 

Onsager’s principle assures that the coefficients bij in Eq. (6) are 
symmetric: 


( 8 ) A 


Let us introduce the quadratic form 

(9) A 9 ■ 

h3 


Then Eq. (6) may be written as 


(10) A 


x _ 

1 To dqi ‘ 


The quadratic form 9 is nonnegative because it is proportional to the 
entropy production, diy+u/dt, and because Tq is positive by definition, as 
can be seen from the following equation: 


(11) A 


djS *i-(-ii 
dt 




89 . 29 

To dqi qi ~ To ~ ' 


In addition, let us define 


(12) A 


v = <?i - T 0 y x . 


Then the equation of evolution (7) can be written in the familiar Lagrangian 
form 


(13) A 


89 dry 

dqi + dqi 


It is easy to see that in the neighborhood of an equilibrium state (in 
which qt — 0, q% — 0, Qi = 0, Xt = 0), V is a quadratic form in qi. 
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For, on expanding Y into a power series in q it the constant term has no 
significance and can be removed, whereas the linear terms must all vanish, 
because otherwise Eq. (13) cannot be satisfied at the equilibrium condition. 
Hence, if sufficiently small values of qi are considered, a quadratic form is 
obtained if higher powers of qi are neglected: 

(14) A Y = ~ ^ O'ijQiQj ■ 

i,j 

Since only the sum is of interest, we may assume, without loss of generality, 
that this quadratic form is symmetric, 

(15) A u ij — dji . 

Hence, Eq. (7) can be put in the form 

(16) A 'y ] bjj<jj + y ] a i jQj = Qi- 

i 3 

Equation (16) assumes a form which occurs frequently in the theory 
of vibration of discrete masses. Evidently, Y plays the role of a potential 
energy and @ that of the dissipation function which occurs in the usual 
vibration theory. 

13.6. RELAXATION MODES 

Let us now consider the solutions of the equation of evolution 

71 71 

(1) y ^ “ 1 ” y ^ bijqj = Qi, i = 1 , 2 , . .., n , 

j-i j= l 

where the coefficients and b t j are real-valued and symmetric, 

Q-ij — C; i, bij — bji . 

We shall ignore the inertia forces for the moment, so that Qi do not involve 
the acceleration q'j- The nature of the solution depends very much on the 
nature of the quadratic form 

1 7i n 

( 2 ) 

i=l 3=1 

which has been shown above to be proportional to the entropy production 
dj^i + n/dt and is thus nonnegative. However, & can be identically zero 
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if the process is reversible. In any case, a coefficient by{i j) will be 
zero if there is no interference between the processes qi and q-j. For a 
certain i, the diagonal coefficient bn will be zero if q x do not participate 
in the irreversible entropy production (i.e., if q t is a reversible variable), 
in which case all coefficients bn,, b m vanish. Hence, in general, the 
quadratic form *3 is not positive definite. However, it can be reduced into 
a reduced positive definite form if all the reversible variables are eliminated 
(unless ® = 0, in which case there is no problem). Let us assume that this 
reduction has been done, so that the variables q%, q 2 ,..., q m (rn < n) really 
participate in the irreversible entropy production. Then we can assert that 
the quadratic form 

( 3 ) & = \ it, £ 

i=ij=i 

is positive definite in the m variables <ji(i — 1,..., m). 

Equations (1) can then be separated into a group of m equations in¬ 
volving qi,...,q m , and another group of (n - m) equations which does not 
contain time derivatives. The latter group of equations can be solved for 
q m + 1 ,... ,q n , which can be substituted in turn into the first m equations. 
In this way we obtain a sharper restatement of the equations of evolution: 

m m 

(4) i = l,2,...,m, 

j -1 i=i 

where the matrix [by] is real-valued, symmetric, and positive definite. It is 
easy to see that the new coefficients are again real and symmetric. 

Let us now consider the solutions of Eqs. (4) in the homogeneous case 
in which Q[ = 0. The equations 

m m 

(5) '52 a 'ij<l3+ '£,t > i j q j = 0, i — 1,... ,m, 

3 =1 i =1 

admit a solution of the form 

(6) qj = , J = 1,2,... ,m, 

where ipj are constants. On substituting Eq. (6) into Eq. (5), we have 

m 

‘y X a n ~ — 0 , 

3 =1 


( 7 ) 


* = 1 ,... ,m. 
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To find a set of nontrivial solutions ipj poses an eigenvalue problem. The 
eigenvalues satisfy the determinantal equation 

( 8 ) det | a,ij - Xbij\ — 0 . 

Since the square matrices [aj •], [bij\ are real, symmetric, and [fcjj] is, positive 
definite, all eigenvalues Ai,...,A m are real-valued. Whether they are all 
positive or not depends on the nature of the quadratic form 
-mm 

(9) 

1 «=1 3 = 1 

The irreversible process is considered as a disturbed motion about an equi¬ 
librium state. If the equilibrium state is stable, then y is positive definite 
and all roots Aj,..., A m are positive. In that case, every disturbance tends 
to zero with increasing time [see Eq. ( 6 )]. If the equilibrium is unstable, 
then y is indefinite and some of the roots A will be negative. In that 
case, the disturbed motion will increase exponentially with time. If the 
equilibrium is neutrally stable with respect to some coordinates, then y is 
positive semi-definite; some of the roots A may be zero, while the others 
are positive. 

It is known that corresponding to the m eigenvalues Aj,..., A m (some 
of them may be zero or may be multiple roots), there exist m eigenvectors 
{i// 1 - 1 },..., } which are linearly independent and orthonormal: 


( 10 ) 


EEmIX’ = 


0 if k ± l , 
1 if k = l , 



0 if k 7 ^ l, 
A k if k = l. 


The general solution of Eqs. (5) is 


( 11 ) qj = ^ ip^e Xkt , j = 1 , 2 ,... ,m. 

k~\ 

Each solution corresponding to a Afc(fc = 1 ,..., m), 

(12) ?f ) =^V A ‘‘, j = 1,2,..., m, 

is called a relaxation mode. 

Thus, for an irreversible process with m fluxes <h,..., q m participating, 
there exist m linearly independent orthonormal relaxation modes. 
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13.7. NORMAL COORDINATES 


Now we shall consider the nonhomogeneous case, so that we may find 
the evolution of an irreversible process under a given set of forcing functions. 
As in the theory of mechanical vibrations, an introduction of the relaxation 
modes as the basis of generalized coordinates will decouple the equations 
of motion and thus lead to simple solutions. 

We introduce the linear transformation from the state variables { q *} to 
the normal coordinates {&} by the equations 

m 

(1) qi = Yl^i k) ^ k ’ i = 1 ,..., m , 

fc=i 


where {ip[ k ^} is the modal column of the fcth relaxation mode. Let the 
relaxation modes be normalized as in Eq. (13.6:10). Then the potential 
and dissipation functions assume the form 


( 2 ) 


fc=l 


dr a® 
d(,k + dik 


The Lagrangian equations of motion, Eq. (13.6:4), now become decoupled, 

(3) 

where Sis the generalized force corresponding to £k, 

m 

(4) E fc = £^fQ'. 

3 =1 

From Eqs. (2) and (3) we have 

(5) = S*., k — 1,2,... ,m. 


The Laplace transform of Eq. (5) is, for zero initial condition (£*, = 0 when 
f = 0), 

(6) (s + Afc)f fc = , fc = 1,2,... ,m, 


where 

rOO pOO 

(7) Ik = / e~ at Zk{t)dt, Ek ~ E k (t)e~ st dt. 

Jo Jo 
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The solution of Eq. (6) as algebraic equations in s is 

(8) & = - ' k = 1,2 ,... ,m. 

Substituting back into Eq. (1), we obtain the Laplace transform of qi(t), 

rOO 7711 77 

( 9 ) Qi — I e- at q i {t)dt = ^Tip ( i k) ■ 

7o s + Afc 

y. ™ ) Q' 

hh s+Xk 

Equation (9) provides the solution to the response problem in Laplace 
transformation language. On transforming back to the physical plane, the 
time history of the evolution of an irreversible process in response to exter¬ 
nal forcing functions is obtained: 

mm -t 

(10) ▲ = / e ~ XkT Q'jtt - r ) dT , 

j=lfc=l 

or 

mm «£ 

(10a) 9i(*) = / e ~ Xk ^~ T) Q'ji' 1 ') dr . 

3 = 1 fc=l - 70 

This gives the complete solution if m = n. The case m < n [cf. dis¬ 
cussion in Sec. 13.6 following Eq. (13.6:2)] is treated in the exercise below 
(Problem 13.2). 

It is seen from Eq. (17) below, that, in general, if some coordinates are 
reversible, qi are related to Qi by equations of the form 


(11) A 

m 

^( s ) = X i 

j- 1 

/ m A( fc) \ _ 

(12) A 

m r m pt 

®(t) = X X4 fc) / e^-^Q^dr + AijQjit) 

j= l L fc=i Jo - 1 

where 



(13) A 


aW — aW A., — A., 

ij ~ 71 ji J ^*3 — -™3t • 



Sec. 13.7 


NORMAL COORDINATES 


449 


Problem 13.2. Write Eq. (13.6:1) in the form of partitioned matrices 


(14) 


where 


A : 

: F 


r q (1) ■ 

+ 

B : 

: 0 

C : 

: H 


q (2) 

0 : 

: 0 



‘flll 

I 

' Qi " 

q< 1 > = 


, Q (1) = 



_ Qm _ 


Qm_ 




<?m+l 


Qm+1 

II 

"cr 


, q (2) = 



. . 


. Q» . 



“11 012 ■ • 

• 0,im 


6n 612 • ■ 

• 6lm ' 

A = 

flml flm2 ' * 


, B = 

_6ml ^m2 ' ’ 

• bmm 



Gl.m + 1 * ‘ 

• ain " 



' * Q>m+l t m 

F = 


* 0>mn _ 

, c = 

Q*nl 

Q>nm 




H = 


L ’ * ‘ 0, n n J 

Assume that det |H| ^ 0 so that H _1 exists and that at t = 0, = 0, = 0. 

Show that if Eq. (13.6:4) is written as 

A' • q (1) + B • q (1) = Q', 

A' = A - FH -1 C , 

Q' = QW - FH _1 Q (2) . 

Let 4' be a square matrix formed by the columns of relaxation modes, 4> T be the 
transpose of 4', and A be a diagonal matrix of the relaxation spectrum: 


(15) 
then 

(16) 
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= 


W 2) • 


, A = 

Ai 

0 

0 • 
A 2 ■ 

• 0 ' 

• 0 


A" 

■ 

• \ m) . 


0 

0 ■ 

■ A m 


Then the Laplace transforms of q^, q^, Q (1) , Q 1 ' 2 * are related by 
q (1) (s) = ^(A + sI) _1 ’4' T (Q (1) - FH _1 Q< 2) ) , 

(17) 

q (2) (s) = H 1 Q (2) - H _1 C*(A + 5l)- 1 4' T (Q (1) - FH _1 Q (2) ), 


where 


(A + sI)- 1 = 


s + Aj 


O'- 0 


0 •" 0 


s + A m -I 


13.8. HIDDEN VARIABLES AND THE 

FORCE-DISPLACEMENT RELATIONSHIP 

In many physical problems a great many variables are “hidden,” that 
is, we do not observe them. For example, we may have a system with 
n variables in which only k internal forces Qi, ■ ■ ■ ,Qk, corresponding to 
the displacements qi,...,qk, are examined, while the remaining coordi¬ 
nates constitute an “internal system.” An example of this occurs in the 
stress-strain relationship of a polycrystalline metal with stresses Qi = < 7 n, 
Qi — <ri 2 — cr 2 i,..., Qe = < 733 , strains q\ = en ,<72 = ei 2 — e 2 i ,. ■. ,q& — 
C 33 ; whereas the intercrystalline thermal currents, movement of interstitial 
atoms, twining, etc., are represented by q 7 ,q&, ■ ■ ■, q n , for which no corre¬ 
sponding external forces are applied. We wish to know the influence of 
these hidden variables on the stress-strain relationship. 

In the general case, some of the hidden variables qk+i, Qk+2, • • • ,qn may 
not participate in the entropy production: they may be reversible. These 
reversible hidden variables are related to the other variables by linear alge¬ 
braic equations, and can be eliminated easily. To simplify our expressions 
we shall assume that this elimination has been done so that every flux 
qk+i , <jk+ 2 , • ■ ■, q n participates in the entropy production. 
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We now divide all the variables into two groups: 


(1) 

qW = 

Qi 

Q2 

q (2) = 

Qk +1 

9fc +2 



qk_ 


Qn 


where is the vector to be observed, the elements of which correspond 
to the generalized forces Qi,... ,Qk written as a vector 


( 2 ) 


Q = 


Q\ 

Q 2 


Qk 


We may now write the equations of evolution in the form of partitioned 
matrices: 


An : A12 


' q(D ' 

+ 

Bn : B12 


■ qCi) ■ 


Q 

> 

> 

to 

to 


q( 2 ) 

_Bf 2 :B 2 2 


q(2) 


0 


where a superscript T indicates the transpose of the matrix. The submatri¬ 
ces An, A 22 , Bn, B 22 are symmetric. The matrix B 22 is positive definite 
because we have assumed that every element of q( 2 ) participates in the 
entropy production. 

Our problem is to eliminate q^ 2 ) and relate Q to q^ 1 ^ directly. Let us 
first consider the subsystem 


(4) 


A 2 2q (2) + B22q^ = 0. 


Since B 22 is a positive definite square matrix of order n—fc, there exists n—k 
linearly independent orthonormal relaxation modes • • •, rp ( ' n ~ k \ 

corresponding to the real-valued eigenvalues Ai, A 2 ,..., ALet ^ be a 
square matrix composed of the columns of relaxation modes, 


(5) 


*= [V 1 ) v (2) ••• -4> (n ~ k) ] , 






M l 




(0 

n — k J 


and let the relaxation modes be so normalized that 

(6) * r B 2 2* -1, 


^ r A22* = A, 
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where I is a unit matrix and A is diagonal: 



'1 • 

'o 



0 

( 7 ) 1 = 

0 • 

■ 1 

II 

0 • 

4 ^ n—k 


By introducing the normal coordinates £ defined by the relation 

(8) q (2) =*£, 


we are ready to simplify Eq. (3). Let us introduce Laplace transformation 
with respect to time for the q’s and Q’s: 


(9) 


<Zi = 


I 


t qi(t)dt , 


etc., and write q(L for the column matrix whose elements are qi, < 72 , • • • > 9fc, 
etc. For our purpose it is sufficient to consider zero initial condition, q^ 1 - 1 = 
0, q( 2 ) = 0, when t = 0. Then, since the Laplace transformation of qi 
is equal to sqi, we can reduce the differential Eq. (3) into the algebraic 
equation 


( 10 ) 


An A 12 

A i 2 A 2 2 


Bn B 12 
Bj2 B 22 




Finally, let us define an n x n matrix 


( 11 ) 


$ = 


I* 1 ) 0 
0 # 


Then it is easy to verify, on account of Eqs. (6) and (7), that 


( 12 ) 

(13) 

(14) 



/AnAi2\ / All Al2^\ 

\ Af 2 A 22 / \ Af 2 A J 

/BhBi 2 \ Bn Bi2^ 

{bT 2 B 22 J^~ [* r Bf 2 I 
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Premultipiy Eq. (10) by substituting Eq. (12), and using Eqs. (13) 
and (14), we obtain 


An + sB u 

Al2^ + sBi2>f r 

q(*) "| 


Q' 

* T Af 2 + s* r Bf 2 

A + si 

it j 


0 


Expanding by rows, we see that this is equivalent to two equations: 

(16) (A u + sBuJqW + (A ia + sB i2 )*£ = Q . 

(17) * t (A£ + sBf 2 )q« + (A + sl)£ = 0. 

FYom Eq. (17), we have 

(18) I = -(A + sI)-^ r (Af 2 + sBf 2 )q« . 

A substitution back into Eq, (16) then gives the final result: 

(19) Q = [An + sB n - (A 12 + sBi 2 )*(A + sl)" 1 *^ + sB^,)]^ 1 ). 

It is clear that the matrix in the brackets is symmetric. The inverse matrix 
(A + si)' 1 is simply 


(20) (A + si)' 1 


1 

s + Ai 

0 


0 

1 

s + A 2 


0 

0 


0 


s + A n _fc ) 

The general term in Eq. (19) may be written in the form 


(21) A 
where 


<?< = £ 

j=i 


+ sc‘. 


•a + Xv 


J a ) 

q 


S + Aa 


( 22 ) A <* = <*, c'ij = Cji, <£>=<£>. 

By taking the inverse transform, we obtain the general form of the gener¬ 
alized force-generalized displacement relationship, 


(23) A Qi(t) = J2 


j=l 


c ijQj(t) + 


n—fc -i 

zwf 

a=l 


e T ^j(r)dr 
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Pig. 13.8:1. Model suggested by Eq. (13.8:23). 

If this equation is interpreted in terms of mechanical models, we see 
that any relaxation phenomena may be represented by a spring, a dashpot, 
and a sum of a great many elements made up of a Maxwell type material 
(Fig. 13.8:1). 

13.9. ANISOTROPIC LINEAR VISCOELASTIC MATERIALS 

Let the six independent components of the stress tensor cqj play the 
role of the generalized forces Qi, ... and let the six independent com¬ 
ponents of the strain tensor eij play the role of the generalized coordinates 
qi,...,qe- According to the results of Sec. 13.8, the effect of relaxation 
modes renders the stress-strain relationship into the form (with summation 
convention used) 

(1) A =CV{s)e k i, 

with 

n D kl(<x) 

(2) A c%(s) = J2jfy+Dij + sD' k /. 

a=l a 

The restrictions imposed by thermodynamics are 

(3) A Cij = GW = C[j , C$=c%. 

Strictly speaking, the last two Laplace transform terms in Eq. (2) have 
no inverse in the time domain. But the inverse of Cj°j /s 2 exists. Hence, if 
s 2 &ki has a Laplace transform, the inverse of cry given by Eq. (1) can still 
be defined. However, as the delta function and higher order delta functions 
are often used in mathematical physics as generalized functions, we can 
formally write 

(4) C#(t) = £ L»g (a) e- A “ 4 l(f) + Dg S(t) + D?/ S^t), 
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where 1(f), S(t) t Si(t) are, respectively, the unit-step function, the delta 
function, and the first-order delta function. The inverse of (1) then gives, 
formally, 


(5) ▲ = E / e' x ^D^ a) e kl (r)dr + D%e kl (t) + • 

A further generalization is possible if the hidden variables are so numerous 
so that the summation over a is replaced by an integration over X a . See 
M. A. Biot 3 1 (1956). 

On the other hand, in analogy to the Neumann-Duhamel generalization 
of Hooke’s law in thermoelasticity, we may wish to include the deviation of 
local temperature from the uniform reference temperature, T - To, as an 
observable variable in the stress-strain relationship. This can be done by 
adding another coordinate q-? t to be the local temperature deviation, while 
the corresponding force Qy, is taken to mean the local value of entropy per 
unit volume above that at the reference state. 

PROBLEMS 

13.3. Specialize the results of Sec. 13.9 to the case of an isotropic material. 
(See M. A. Biot, 13 ’ 1 1954.) 

13.4. From Eq. (13.7:9) or (13.7:11), derive the form of the coefficients Ay 
that relates strain to stress: 

eij = Atj (s)a kt . 


13.5. If, for purely mathematical generality, we write the most general linear 
relation between the Laplace transforms of stress and strain as 

where s is the Laplace transformation variable, and 
ff M „(s) = / 

Jo 

etc., enumerate the ways that this general expression may not satisfy thermody¬ 
namic requirements. (See Biot, 13 ' 1 1958.) 
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Basic equations of thermoelasticity will be discussed in this chapter, and 
their applications will be illustrated in some typical problems. 


14.1. BASIC EQUATIONS 

We shall consider a solid body subjected to external forces and heating. 
We assume that the material is linear elastic, and that it is stress-free at a 
uniform temperature To when all external forces are removed. The stress- 
free state will be referred to as the reference state, and the temperature To 
as the reference temperature, A system of rectangular Cartesian coordinates 
Xi will be chosen. The displacement rq of every particle in the instantaneous 
state from its position in the reference state will be assumed to be small, 
so that the infinitesimal strain components are 

(1) ejj = ~ (uj j T Uj'i ), i, j 1,2,3. 

The instantaneous absolute temperature will be denoted by T, and the 
difference T — To by 6: 


(2) 6 = T - To . 

Under these conditions the basic equations of thermoelasticity are 
The Constitutive Equation. Duhamel-Neumann law. Eq. (12.7:4.) 

(3) era = Cijkieki - (3ij(T - To) • 

Conservation of Mass. Continuity equation. 


(4) 


dp dpvi 
dt ^ dxi 


Conservation of Momentum. Newton’s law. 

( 5 ) **=£*+*■ 


Vi = 


dui 

dt 
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Conservation of Energy. Eq. (12.2:7). 

( 6 ) S = TS' + -a ij V ij , + 

Rate of Change of Entropy. Eqs. (13.1:2) and (13.2:6). 

^ py TdXi dXi\TJ T 2 dxi 

Heat Conduction. Fourier’s law. Eqs. (13.3:4) and (13.4:10). 


Definition of Specific Heat. If = 0 (i, j — 1 , 2,3), then 


In these equations, all the indices range over 1 , 2, 3; p is the mass density, 
Uij are components of the stress tensor, Cyju are the elastic moduli; fiij 
are the thermal moduli; are components of the velocity vector; X j are 
components of the body force per unit volume; S is the internal energy 
per unit mass; .9* is the entropy per unit mass; hi are the components of 
the heat flux vector; fcy are the heat conduction coefficients; and C v the 
heat capacity per unit mass at constant volume of the solid. A dot above 
a variable denotes the material derivative of that variable: 

Dt dt J dxj ’ 

which, under the approximation of small velocities, is the same as the partial 
derivative with respect to time. In this chapter, we assume 


A comma indicates partial differentiation with respect to a spatial coordi¬ 
nate, thus means 60/dxi. Summation convention for repeated indices is 
used. Equation (3) is discussed in Sec. 12.7; Eqs. (4) and (5) are discussed 
in Sec. 5.4; Eq. (6) is discussed in Sec. 12.2; Eq. (7) is discussed in Secs. 13.1 
and 13.2, where hi/T is identified as the rate of change of entropy displace¬ 
ment and ~(hi/T 2 )dT/dxi as the entropy production. We have assumed 
in Eqs. (7) and (9) that there is no heat source in the material; otherwise, 
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to the right-hand side of thses equations we should add a term represent¬ 
ing the strength of the heat source (heat generation per unit time per unit 
volume). Equation (8) is discussed in Sec. 13.3. The coefficients CVjfcj, /3y, 
and kij have the following symmetry properties as discussed in Secs. 12.7 
and 13.3. 


( 10 ) Cijkl — khj — @ijlk — Cjikl * 

(11) 0ij — Pji > 

( 12 ) kij —— hji • 

The energy equation may be put in a more convenient form as follows. 
On introducing the free energy & 

y-ry, 


we obtain, according to Eq. (12.3:6), 



With Eq. (13), Eq. (7) can be written as 


(14) 


i dhj 
Tdxi 


ay . ay- 
py = p d^ e « + p W T 


= ~p~. 


a 2 y 


i &ij P 


a 2 .? 


T. 


'deijdT~' J r dT 2 
A multiplication by T and comparison with Eq. (9) when = 0 yields 

a 2 y 

(15) C v - -T-qt2 ■ 

Also, from Eqs. (13) and (3), it is seen that 

a 2 y 


(16) 


' deijdT dT 
Equation (14) therefore takes the form 


_ d(J H _ 

- orr — rij ■ 


(17) ▲ — — pC v T 4- Tfiijeij , 

which reduces to Eq. (9) if = 0. Finally, on substituting Eq. (8) into 
Eq. (17), we obtain 
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If the material is isotropic, these equations are simplified as follows. 


(19) 

& ij ~~~ 

4- 2Ge ^ — f35ij 6 , 

(20) 

fiij ~ PSij , 

/3= r ^; = (3A + 2 G)a 

(21) 

fey kS-ij • 



The constant a is the thermal coefficient of linear expansion and k is the 
heat conductivity. 

A problem in thermoelasticity is formulated when appropriate boundary 
conditions and initial conditions are specified. Several simple examples will 
be discussed below. A proof for the existence and uniqueness of solution 
under suitable continuity conditions can be constructed in a way analo¬ 
gous to that discussed in Sec. 7.4; an explicit proof in the case of isotropic 
materials can be found in Boley and Weiner 141 , pp. 38-40. 


14.2. THERMAL EFFECTS DUE TO A CHANGE OF 
STRAIN; KELVIN’S FORMULA 

It is a familiar fact that an adiabatic expansion of a gas is accompanied 
by a drop in its temperature. Similarly, a solid body changes its temper¬ 
ature when the state of strain of the body is altered adiabatically. For a 
material like steel, there will be a fall of temperature when the body is 
strained to expand adiabatically. 

Equation (14.1:18) gives the relationship between the rate of change of 
the temperature and the strain with the heat conduction. If heat conduction 
is prevented ( dhi/dxi = 0), the left-hand side of Eq. (14.1:18) should vanish, 
and we obtain at once 


( 1 ) 


— - -JL-q —a 
dt ~ P c/ ij dt 


which is Kelvin’s formula for the change of temperature of an insulated 
body due to strain rate. 


14.3. RATIO OF ADIABATIC TO ISOTHERMAL 
ELASTIC MODULI 

If an elastic modulus is measured on a sample that is completely insu¬ 
lated, or if the change in strain takes place so rapidly that the heat does 
not have time to escape, the measured value may be called the adiabatic 
modulus. On the other hand, it is called the isothermal modulus if it is 
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measured in such a way that the temperature is kept uniform and constant 
throughout the process. 

The coefficients Cijki in the Duhamel-Neumann law (14.1:3) is the 
isothermal modulus of elasticity at 0 = 0: 

(1) aij — Cijkieid - /3ij0 , 0 = T - To . 

If a deformation is adiabatic, and no heat conduction takes place, then 
Eq. (14.1:17) gives 

(2) pC v 6 + ToPijeij = 0, 


or 


( 3 ) 


pC v 9 + T 0 i9 fci e fct = const. 


The constant of integration is zero if = 0 when 0 = 0. On solving Eq. (3) 
for 6 and substituting into Eq. (1), we obtain in an adiabatic process 


( 4 ) 


— ( C'ijkl 


i PijPkl^j e kl — Cijkl e kl • 


Thus, the adiabatic modulus of elasticity C' ijkl is related to the isothermal 
modulus C^ki by the equation 


( 5 ) 


C'ijki 


— Cijki T 



PROBLEMS 

14.1. Derive the relationship of the temperature and the elastic constant 
{dp/dV)r for an ideal gas, and compare qualitatively the temperature change in 
an adiabatic expansion of a gas with the temperature change of a similar process 
in a solid. 

14.2. Show that the Young's modulus of steel determined by an adiabatic 
process such as the propagation of elastic waves or the longitudinal vibration of a 
rod is higher than that determined by a slow static test in which the strains are 
maintained constant for a sufficiently long time for the temperature to become 
uniform. 

14.3. For steel at T = 274.7° K, or nearly 1.6°C, p = 7.0 gm/cm 3 , a = 
1.23 x 10~ 5 , C v = 0.102 cal/(gm-°C), show that dT = -0.125°C if a wire is 
subjected adiabatically to a sudden increment of tensile stress of 1.09 x 
10 9 dyne/sq cm. Over a century ago, Joule [Phil. Trans. Roy. Soc. Lon¬ 
don, 149 (1859), p. 91] gave an observed value dT = —0.1620°C for the above 
case by experiments on cylindrical bars. 

14.4. Consider simple tension and derive the following relationship between 
the adiabatic and isothermal Young’s modulus, E' and E, respectively: 

_1_ = J_ _ a 2 To 
E> ~ E JC v p ’ 
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which shows that the adiabatic modulus E' is always greater than the isothermal 
modulus E. In the equation above a is the linear coefficient of expansion, To is 
the equilibrium temperature, J is the mechanical equivalent of heat, C v is the 
specific heat per unit mass at zero strain rate, p is the density of the material. 

14.5. Show that the theoretical value of the ratio E'/E is of order 1.003 for 
steel and copper at room temperature. 


14.4. UNCOUPLED, QUASI-STATIC THERMOELASTIC 
THEORY 

The basic equations given in Sec. 14.1 combine the theory of elasticity 
with heat conduction under transient conditions. Boundary-value problems 
involving these equations are rather difficult to solve. Fortunately, in most 
engineering applications it is possible to omit the mechanical coupling term 
in the energy Eqs. (14.1:6) or in the heat conduction Eq. (14.1:18) and the 
inertia term in the equation of motion (14.1:5) without significant error. 
When these simplifying assumptions are introduced, the theory is referred 
to as an uncoupled, quasi-static theory, it degenerates into heat conduction 
and thermoelasticity as two separate problems. 

A plausible argument for the smallness of the thermoelastic coupling 
is as follows. We have seen in Sec. 14.2 that the change of temperature 
of an elastic body due to adiabatic straining is, in general, very small. If 
this interaction between strain and temperature is ignored, then the only 
effects of elasticity on the temperature distribution are effects of change 
in dimensions of the body under investigation. The change in dimension 
of a body is of the order of the product of the linear dimension of the 
body L, the temperature rise T, and the coefficient of thermal expansion 
a. If L ~ 1 in., 9 = 1000 °F, a — 10 -5 per °F, the change in dimension is 
10~ 2 in., which is negligible in problems of heat conduction. 

Again, if the temperature rise from 0 to 1000°F were achieved in a time 
interval of 0.1 sec, then the acceleration is of the order 10 ~ 2 -5- (0.1 ) 2 = 
1 in./sec 2 . The change of stress due to this acceleration may be estimated 
from the equation of equilibrium 


A(J XX 


A xp 


d 2 u 

dt?’ 


If the specific gravity of the material is ten and the material is 1 in. thick, 
we have 

Acr rx = x ^ x 10 x 62.4 * J 2 x ^4 “ °- 001 lb / sq in • 
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This stress is negligible in most structural problems in which the mag¬ 
nitude of the stresses concerned are of the order of the yielding stress or 
ultimate stress of the material. 

Some examples of the coupled theory are presented in Chapter 2 of 
Boley and Weiner’s book . 141 It is pointed out that the thermomechanical 
coupling is important in the problem of internal friction of metals. 


14.5. TEMPERATURE DISTRIBUTION 

We shall denote the temperature above a reference temperature by 9, 
the space coordinates by ( x , y, z ), and time by t. In an uncoupled theory 
for an isotropic material, Eq. (14.1:18) becomes 



In the following discussion we shall assume k and C v to be constants. 
Then, 


( 2 ) 


d 2 e d 2 e d 2 o 

dx 2 ^ dy 2 dz 2 


nr 96 

=pcv m- 


In cylindrical coordinates (x,r,ip), we have 

~d 2 9 d 2 6 1 80 1 d 2 9 


dx 2 dr 2 r dr r 2 dip 2 


(3) k 
In spherical polar coordinates (r,ip,<p), we have 

(4) k 


= pC v 


dt 


'1 d 

( 2 dd \ 

i 1 9 

( . , 30 , 

1 d2p ' 

r 2 Q r 

r»J 

r 2 sin i/j dip 


r 2 sin 2 ip dip 2 _ 


nr d l 
p0v dt 


For the special case of steady heat flow, ddjdt vanishes, then 


( 5 ) 


V 2 0 — 0. 


Numerous examples of solution of these equations can be found in books 
on the conduction of heat in solids, such as Carslaw and Jaeger, McAdams, 
etc. (See Bibliographies 14.1 and 14.2.) 

Example. Steady Temperature Distribution in a Disk Cooled by Air 
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If there is a heat source in the material, of Q Btu per unit volume per 
unit time, the equation of heat conduction is modified into 


( 6 ) 


BO 

k V 2 0 + Q = pC v — 


Hot gas 


Cooling ‘ 


-Heat flux h 


-hO 


An atomic reactor with a radiation source 
may present problems of this kind. A tur¬ 
bine disk may also be described approxi¬ 
mately by an equation of this type. Con¬ 
sider a circular disk as shown in Fig. 14.5:1. 

The heat flux from the rim represents 
the heat from the turbine blades operat¬ 
ing in hot combustion gas. The heat loss 
on the disk surface represents the effect 
of cooling air. Let 0(r) be the average 
temperature (above the cooling air) across 
the disk at any radius r. The heat loss per unit area on each face is -kO. 
The total heat loss per unit volume is Q = —2K0/b. Hence, for the axially 
symmetric temperature distribution, the heat conduction equation at the 
steady state is 


Pig. 14.5:1. A turbine disk. 


( 7 ) 


f0 ld0_2 k 

dr 2 ^ r dr kb 


This is the differential equation for the modified Bessel functions. The 
appropriate solution is 

( 8 ) - - < /2 « 


where Iq(z ) is the modified Bessel function of the first kind and zeroth 
order. 

The constant A is determined by the boundary condition at the rim 
r = R, where a heat flux h enters the disk from the hot gas: 


dr 


( 9 ) 

Therefore, 

, . , . 12k, T , ( 12k 

( 10 ) k \ kb Ia \V kbj = ’ 

where Iq is the derivative of 7o and is equal to I\ . Thus, finally, 

( 11 ) 9 = 


at r = R. 


h Io(^2K/kbr) 


\j2Kkjb 11 (\j2K/kbR) 
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14.6. THERMAL STRESSES 

In the uncoupled, quasi-static theory, the stress and strain fields are 
computed for each instantaneous temperature distribution 6{xi,x 2 , 2 : 3 ) 
according to Eqs. (14.1:1) to (14.1:5), with appropriate boundary condi¬ 
tions. For an isotropic material, we have 

( 1 ) (jjj — T 2Geij 05ij9 , 

( 2 ) Oij t j + Xi — 0 . 

The strain field 

(3) eij = - (uij + Uj t i) 
must satisfy the compatibility conditions 

( 4 ) e ij,kl 4 “ &jl,ik = 9 • 

As a conjugate to Eq. ( 1 ), we have 

(5) = — crij — — (TftfiSij + a66ij , fi = ^ _ 2i> ' 

Let us first observe that if < 7 ^ = 0, then 

(6) = aOSij 

and Eq. (4) is reduced to 

(7) O t ki$ij + 6,ij&ki - Ojrfik - = 0, 

which is satisfied if 

( 8 ) Qjj = 0 , i,j = 1,2,3. 

The solution is 

(9) 6 = a 0 + cqxi + a 2 x 2 + 0 , 3 X 3 , 

with arbitrary coefficients a*. Hence, if 9 is a linear function of spatial 
coordinates xj, X 2 , X 3 , and if the displacements on the boundary are unre¬ 
strained, then it is possible to satisfy the compatibility condition without 
calling stresses into play. Generally, for an arbitrary temperature field, the 
strain field corresponding to thermal expansion alone, Eq. ( 6 ), will not be 
compatible, in which case thermal stresses must be called into action. 

If we substitute Eqs. (1) and (3) into Eq. ( 2 ), we obtain the generalized 
Navier’s equation 

( 10 ) 


+ (A + G)u^ t ni -f- — (36 ^ — 0. 
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This is particularly convenient to use if the boundary condition is specified 
in terms of displacements: 

(11) Ui — fi(xi,X2,X3) on the boundary. 

On the other hand, if tractions are specified on the boundary, i.e., if 

V 

(12) Ti = &ijVj = gi(x 1 , X2, £ 3 ) on the boundary, 

where Vj is the outer normal vector to the boundary surface, and gfs are 
specified, we must have 

(13) + Guij -f Guj s i - /38ij9 ] = gi on the boundary. 

By comparing Eqs. (10)-(13) with the corresponding equations in lin¬ 
ear elasticity (Secs. 7.1 and 7.3), we see that the effect of the temperature 
change 9 is equivalent to replacing the body force X* in Navier’s equations 
by Xi — 09^ and to substituting for the surface tractions g x by g t + 01/$. 
Thus the displacements iq, u 2 ,113 produced by a temperature change 6 are 
the same as those produced by the body forces -09 ti and the normal trac¬ 
tions 06 acting on the surface of a body of the same shape but throughout 
which the temperature is uniform. These facts can be stated in a theorem.* 

Theorem: Duhamel-Neumann Analogy. Consider two bodies of ex¬ 
actly the same shape but with conditions prescribed as shown in Fig. 14.6 : 1. 
Then 


u^(xi,a:2,a:3;0 = u'l l \xi,X2,x 3 -,t) , 
Oii = - 06 { ~ l) 5 ij . 



Pig. 14.6:1. Duhamel-Neumann analogy. 


♦The Duhamel-Neumann theorem was stated in this form by A. J. A. Morgan. 
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Problem 14.6. Thermal Stresses in Plates with Clamped Edges. Consider 
a flat plate of arbitrary shape clamped at all edges. Let the midplane of the plate 
in the unloaded position be the x, i/-plane and let the temperature be uniform 
in the plane of the plate but variable throughout the thickness, i.e., 6 = 6{z). 
Show that under these conditions the plate will remain flat, i.e., no bending will 
be introduced except possibly near the edges. Determine the thermal stresses in 
the plate. This example shows the extreme importance of the edge constraints 
on the thermal stress problem. 


14.7. PARTICULAR INTEGRAL-GOODIER’S METHOD 


By introducing suitable particular integrals, the problem of thermal 
stresses can be reduced to the solution of the homogeneous Navier’s equa¬ 
tions. Let us assume a solution of Eq. (14.6:10) in the form 


( 1 ) 


Ui = 


dcj> 

dxi 


where 4> is a displacement potential. Let us assume also that the body forces 
are conservative so that 


( 2 ) 


Xi = - 


dP 
dxi ’ 


then the generalized Navier’s Eq. (14.6:10) may be written as 


( 3 ) 


(A + 2 Gfoiw = P,i + P9,i. 


An integration yields 

(4) ^ = S5 (p + W ' 

The constant of integration may be absorbed in the potential P. Since 
Eq. (4) is linear its solution may be written 

(5) 4> = <j>^ + 4 ^ i 

where <^( p ) is a particular integral satisfying Eq. (4), and <p'' c ' ) is the com¬ 
plementary solution satisfying the equation 

(6) V 2 ^ c > = 0. 

A particular integral can be taken in the form of the gravitational potential 
due to a distribution of matter of density (P + /?<?)/(A + 2G), i.e., 

(7) = MAT2G) If J l [P( ^ } + d < dx * dx 3 - 
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where 


(8) r = [(xi - xi) 2 + (x 2 - X 2) 2 + (x 3 - Xg) 2 ] 1/2 . 

Once is determined, the boundary conditions for the complementary 
function <j>^ can be derived. 

There are problems which cannot be solved by the method of scalar 
potential. However, the particular solution can still be used. From 
4>( p \ the displacements uf' 1 are computed according to Eq. (1). Then 
we put 

( 9 ) m = + u\ p ^. 

The equations governing u\ c ^ are the homogeneous equations 

(10) {\ + G)-^~ +GV 2 u < f ) =0, e (c) = , i = 1,2,3. 

OX{ 

The boundary conditions for u\ c ' 1 must be derived, of course, from the orig¬ 
inal boundary conditions by subtracting the contributions of the particular 
integral on the boundary. 


Problem 14.7. Consider the special case in which Xi = 0. Show that, in 
view of the heat conduction equation 


kV 2 e = P C V 


a particular solution <^ p > of Eq. (4) is 


(11) <j> iv) (xi,X 2 ,x 3 ;t) = f 0(xi,22,x 3 ;£)d£, 

1 - v j t 

provided that 0 -> 0 as t -> 00. 


14.8. PLANE STRAIN 

A long cylindrical body is said to be in 
the state of plane strain parallel to the x, 
y-plane if the displacement component w 
vanishes and the components u and v are 
functions of x, y, but not of z\ and the 
temperature distribution is independent of 
z also. Let the body occupy a domain 
V (x, y ) with boundary B in the x, y-plane 
(Fig. 14.8:1). Then, in V + B, 
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(1) u = u(x,y), v = v(x,y), w = 0, 0 = 0(x,y). 

It is easily verified that 


(2) s xz — 6y Z — e zz — 0, 

( 3 ) (T XZ = (Ty Z = 0 , 

( 4 ) a zz = \{e xx 4 - e yy ) - (36 - v(a xx 4 - a yy ) - aE6 . 

We may derive Duhamel’s analogy explicitly in this two-dimensional case. 
For a body in V + B, subjected to a temperature rise 8(x, y), but with zero 
body force and zero surface traction over B a , and assigned displacements 
Ui on B u , where B = B a + B u , the stress field is the same as that given by 
a superposition of a hydrostatic pressure 


( 5 ) 


U"xx — <?yy — @Q , O xy — 0 , 


with that given by another problem in which the same body in V + B is 
kept at the uniform reference temperature, but subjected to a body force 
X = —f3(d6/dx); Y = —f3{d9/dy), and a tension (36 over B a . 

Let us see how the last-mentioned problem can be solved. The equation 
of equilibrium is 


(6) 


da xx 

dx 


+ lf ]L ~ p T = 0 ’ 

ay ox 


and the compatibility condition is 


dOxy ‘3a yy 

dx dy 


dy 


( 7 ) 


d 2 e xx d 2 e yy _ d 2 e xy 

dy 2 dx 2 dxdy ’ 


which can be reduced to 


( 8 ) 


V 2 ^ 


+ <r y y) = T ^X 2 6. 


This equation can be obtained by substituting —(3d9/dx and —(3d0/dy for 
X and Y in Eq. (9.2:10). If we introduce the Airy stress function $ so that 
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then Eq. (6) is identically satisfied and Eq. (7) becomes 


V 4 $ = --^-V 2 0, 
1 — v 


„ 2 a 2 <? 2 

v 2 =-1- 

dx 2 dy 2 


where a = (1 - 2 v)/3/E is the linear coefficient of expansion. But 


kV 2 e = P c v —. 


Hence, finally, 


aEpC v 88 
k( 1 — v) dt ' 


The details are left to the reader. 


Example 

If a long cylinder with axis z is subjected to steady heat flow with 
resulting temperature distribution 9(x, y) independent of t, then V 2 0 = 0 
and Eq. (12) implies that 

(13) V 4 <f> = 0. 

Let the boundary surface of the cylinder be unrestrained. According to 
Duhamel analogy, the boundary condition is a tensile traction /38. Hence, 
according to Eq. (9), the boundary conditions can be satisfied by taking 

d 2 $ d 2 $ d 2 $ 

(14) 8?=°' 8? =0 ' aiai =0 °«‘he boundary. 

A solution is evidently $ = 0. For this solution, a superposition of Eqs. (5) 
and (9) yields 

(15) du — &yy = &xy = ^ i 
whereas, from Eq. (4), 

(16) o zz = —aE8. 

If the region V(x,y) occupied by the cylinder is simply connected, and 
if plane strain condition can be assumed, we can quote the uniqueness 
theorem to say that this is the solution. Hence, we conclude that if a 
cylinder is simply connected and unrestrained on the surface, then under 
steady two-dimensional heat flow and plane strain condition, there will be 
no thermal stress in any surface element parallel to the cylinder axis. 

PROBLEMS 

14.8. Plane Stress. A body is said to be in a state of plane stress parallel 
to the xi, X 2 -plane if 

CJ 13 = <723 = <733 = 0 
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so that 

udui l + u 

633 = — - — - 1 - - ati , ei3 = e23 = 0 , 

1 — v axi 1 — 1 / 

where i = 1,2. Prove the following Duhamel analogy (Fig. 14.8:2): 

Problem I: Heating Problem II: Nonheating 



Then 


a (i) 




e H = e L 
M) = -.(II) 


a 


aE0 m . 

1 * 
1 - 1 / 


It is seen that the role of the parameter f3 = aE/( 1 - 2i/) is now played by the 
parameter a£/(l — i/). If Problem II in plane stress is posed in terms of stresses 
and Airy stress functions so that 

9 2 $ aEO 0 2 $ aE9 d 2 $ 

t7n dx\ 1 — v ’ 1722 dx\ 1 — v ’ CTl2 dx\dx 2 ’ 

then 

V 4 4 = -oEV 2 «, = 5 + 

14.9. Consider a thin plate parallel to the x , j/-plane subjected to a temper¬ 
ature distribution T(x,y) which is independent of z. The Duhamel analogy can 
be applied either in the three-dimensional form or in the plane stress form with 
the vanishing of er ZI , <r ZXl a zy , introduced explicitly at the outset. Demonstrate 
the similarity and differences of these two analogies. 


14.9. AN EXAMPLE — STRESSES IN A TURBINE DISK 

Let the cooled turbine disk which is treated in Sec. 14.5 (see Fig. 14.5:1) 
be rotating at an angular velocity u>. The body force per unit volume is 
then radial and equal to u> 2 pr, where p is the density of the disk material. 
If the thickness of the disk is small compared with its radius R, the axial 
stress is negligible. The disk is then in a state of plane stress. 
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On account of the fact that all stresses are considered to be functions of 
r only and the boundary condition r rz = 0 at r = R, the shearing stress T rz 
vanishes and only the tension strains e rr , egg and tension stress a rr , ogg are 
to be determined. The elastic displacement has only the radial component 
u, which is a function of r. Then 


( 1 ) 


du 

dr 


u 

egg = - , 
r 


, . du 1 , . . 

( 2 ) ^ - 1/(799 ) + a(/ > 

'll 1 

(3) - = (<700 - V<Trr) + 010 . 

The equation for the equilibrium of stresses and body forces is 


da rr a rr (Jgg 2 n 

(4) —-1-+ u/pr = 0. 

dr r 

To solve for stresses directly, we eliminate u from Eqs. (2) and (3). Thus, 

1 / dcrgg da rr \ _ <r rr - agg aE dO 

V dr ) r 1 + u dr ' 

An elimination of (<r rr - <rgg)/r from Eqs. (4) and (5) yields 


( 6 ) 


da rr 
-dT + 



+ aE— + (1 + v)pu 2 r = 0. 
dr 


But Eq. (4) gives 

(7) <700 = ^ (rcr rr ) + u 2 pr 2 . 

Therefore, from Eqs. (6) and (7), the equation for oy r is 


( 8 ) 


1 d / \ 

r 2 dr \ dr ) 


+ aE— + (3 + i /)ui 2 pr - 0. 
dr 


On integrating twice, we obtain 
(9) a TT = + Ci - aE 


f 4 <* 4 / 

Jo V Jo 




3 + V 2 2 
—at pr . 


The integral involving 6 may be simplified by interchanging the order of 
integration. Thus 


0 ( 0 ) 
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Hence, Eq. (9) may be written as 


( 10 ) 


tr rr — 


lCi 
2 r 2 


+ C2 + 


aE9( 0) aE 


Jo 


3 + v 2 2 

—arpr 2 . 


The boundary conditions are that a Tr = 0 at r = R and <j rr remain finite 
at r = 0. Hence, 


( 11 ) 


aE 


r wo ^ 

Jo r Jo 


3 + 1/ 


u> 2 p(R 2 — r 2 ). 


Equation (7) then gives 


( 12 ) 


(Jqq = aE 


hi 

2 (3 4- v)R? — (1 + 3i/)r 2 

—Hr—-• 




It is interesting to note that the first integral in Eq. (11) is one-half 
the average temperature throughout the whole disk. The second integral is 
one-half the average temperature in the inner portion of the disk (between 
0 and r). Therefore, the thermal stress in the radial direction, i.e., the part 
of cr TT caused by 0(r), is proportional to the difference between the average 
temperature throughout the whole disk and the average temperature for 
the inside portion between 0 and r. The circumferential thermal stress 
differs from that in the radial direction by 

(1 — lApr^uP' 

(13) — cr Tr = a£[0 aV g(O to r) ~ ^C 7 ”)] "1 ^ • 

For the particular problem of the turbine disk treated in Sec. 14.5, the 
temperature is given by Eq. (14.5:11). Since 


I zIo(z)dz = zli(z ), 

Jo 


we have 


and 


- r = A ^ hW^/kbr) 
r 2 7o ? ? € 2KrI 1 (^/kbR) 


hi 

With these relations, the thermal stresses can be easily calculated. 


AA 

2k R 
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14.10. VARIATIONAL PRINCIPLE FOR UNCOUPLED 
THERMOELASTICITY 


In uncoupled thermoelasticity theory, the temperature field is deter¬ 
mined by heat conduction and the influence of the latent heat due to change 
of strain is ignored. Variational principles analogous to those of Chapter 10 
can be derived. In fact, the only difference between the linear elasticity 
of Chapters 7-9, which is valid in isothermal or isentropic conditions, and 
the thermoelasticity of the present chapter lies in a difference in the stress- 
strain law. Furthermore, in linear thermoelasticity concerning infinitesimal 
displacement, even this difference in stress-strain is hidden in the following 
expressions which apply to both elasticity and thermoelasticity: 


( 1 ) 


dw _ dw c _ 

d eij ~ aij ’ ddj ~ e ' 


Here W (e^, T) and W c (&ij,T) are the strain energy function and the com¬ 
plementary strain energy function, respectively. W (e^, T) is expressed in 
strain components; W c (oij,T) is expressed in stress components; in linear 
theory they are equal: W = W c . For isothermal or isentropic elastic¬ 
ity (Chapter 7) W is given by Eq. (12.6:3). For general thermoelasticity 
(Chapter 14), W is given by Eq. (12.7:2) or (12.7:11). 

With these remarks we may derive the following theorems. 

Let be the total strain energy of a body which occupies a region V 
with boundary B, 


( 2 ) 


W = 



dv , 


where W(eij,T) is given by Eq. (12.7:2). 

Let the body be subjected to a body force Xi per unit volume in V, 
and surface tractions Ti on B a in B). Both Xi and T) are assumed to 
be functions of time and space not influenced by the small elastic displace¬ 
ments. Let Xf be the kinetic energy of the body and sd be the potential 
of the external loading: 


(3) 

(4) 


X 


U 


dui dui 

v dt dt 


pdv 


si — — I XiUi dv — I 

Jv Jb< 


Ti Ui dS. 


Then the Hamilton’s principle states that of all displacements Ui that satisfy 
the boundary condition that 


(5) Ui — gi(x i, 2 : 2 , £ 3 ) over B u = B — B a , 
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the one that also satisfies the equations of motion minimizes the integral 

(6) f (9/ — + $4)dt = minimum, 

J to 

where to, fi are two arbitrary instants , and where the admissible variations 
Sui are triply differentiable and satisfy the conditions 

(7) Sui = 0 on B u in to < t < t \, 

(8) Sui = 0 at t = to and t = t\ in V + B. 

If we consider static equilibrium, then the principle of minimum po¬ 
tential energy for thermoelasticity states that of all displacements that are 
continuous and triply differentiable in a body V + B, and satisfying the 
specified boundary displacements tq over the surface B u , the one that also 
satisfies the condition of equilibrium minimizes the potential 

(9) / W(eij ,T)dv - / XiUi dv - / Ti dS = minimum. 

Jv Jv JB a 

Similarly, on varying the stresses Oij, the complementary energy theo¬ 
rem in thermoelasticity states that the stress system <Jij that satisfies the 
equations of elasticity minimizes the complementary energy with respect to 
variations of 

(10) / W c (oij,T)dv - / XiUidv - / T t UidS = minimum. 

Jv Jv Jb u 

W c = -p<h is given by Eq. (12.8:15). The variations Soij are subjected to 
the conditions (S&ij),j = 0 in V and SoijVj = 0 over B a , where surface 
tractions are specified. 

Problem 14.10. Derive Euler equations for the functionals in Eqs. (6), (9), 
and (10) and compare the results with the basic equations of Sec. 14.1. 

14.11. VARIATIONAL PRINCIPLE FOR HEAT CONDUCTION 

The basic equations for heat conduction, Eqs. (14.1:8) and (14.1:9), are 


( 1 ) 


hi = 



dhi 

dxi 



i = 1,2,3, 


(2) 
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where C v is the specific heat per unit mass at constant volume, p is the 
density, A:y are the coefficients of heat conduction, 0 — T - T 0 is the dif¬ 
ference of local temperature T from a uniform reference temperature To, 
and h (hi, /i 2 > ^ 3 ) is the heat flux vector per unit area. A problem in heat 
conduction for a body occupying a volume V with boundary B is to find a 
continuously differentiable function 0(xi,X2,X3;t) which satisfies Eqs. (1) 
and (2), the boundary conditions 


(3) 

0 = do(x,t) 

on B\ 

(4) 

h n = k(x, f)[#o(a:, t) - 6(x , £)] 

on B — B\ 

and the initial condition at t = 0 


(5) 

e~e a (x,o) 

in V and B . 


where h n is the component of the heat flux vector h in the direction of 
the outer normal to the boundary, i.e., h n = hit/, and 0 o is an assigned 
temperature. Equation (4) represents Newton’s law of surface heat transfer, 
6 0 (x, t) is the temperature just outside the boundary layer, and k is the heat 
transfer coefficient which usually depends on the space coordinate x. 

We shall now prove Biot’s variational principle which states that if 
C v is a constant, then Eqs. (l)-(4) are the necessary conditions for the 
following variational equation to hold for an arbitrary variation of the 
vector Hi 

(6) 6 [ [ A ij H j SH i dv+ f OoSH^dS' = 0, 

JV 2 JV JB, 

under the stipulations that 

(7) P C V B = —H it i 

and that 6H% = 0 over the boundary B — B\ , where heat flux is so specified 
that dHi/dt = hi satisfies Eq. (4). The matrix (Ay), called the thermal 
resistivity matrix, is the inverse of the conductivity matrix (kij): 


(8) 


(Ay) = (kij) 1 . 
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Proof. First, we have 
6 2 


(by substitution) 
( i , j being dummy indices) 
dv 


r Q 2 r i 

5 I pC v —dv = & I — —Hi'iHjjdv 
Jv 1 Jv 

■ L w. H,iSHmds - L {-k H A SH>dv 

(by Gauss' theorem) 

= — f Q&HjVjdS + j djSHjdv. 

Jb Jv 

Hence, Eq. (6) is 

0= f (8 tj + XijH^SHidv - [ (0 - 0 o )6HiVidS - [ OSH^dS. 

Jv Jb x Je-Bt 

To satisfy this equation by arbitrary 5Hi in V and on B\, the necessary 
conditions are 


6j + A ijHj = 0, or Hj = —kijQj in V , 

9 — 9o on B\, 

SHjVj = 0 on B — B\. 

The first equation, together with Eq. (7), is exactly Eq. (1). The second 
equation is the same as Eq. (3), the last equation is in accordance with the 
stipulation stated. Q.E.D. 

Applications of this variational principle will be shown later in 
Secs. 14.13. A few remarks will be made here. First, the vector Hi is related 
to the heat flux hi'. 


(9) 


hi = Hi- 
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If we impose the condition that 

(10) Hi =0 when 6 = 0, 

then Eqs. (2) and (7) are consistent since C v is a constant. Comparing 
Eq. (9) with Eq. (13.1:3), it is seen that Hi is proportional to the entropy 
displacement introduced by Biot. 

Second, the special way in which the second term in Eq. (6) is posed 
should be noted. Biot denotes this term by SD and calls it the variation 
of a dissipation function. The justification of this terminology becomes 
clear when generalized coordinates are introduced. Let qi,... ,q n be the 
generalized coordinates so that 


( 11 ) 


Hj — Hj (^l, q2 , ■ • • 19n> *^1,3*2 > ^ 3 ) • 


Then 


Hence, 

( 12 ) 

Therefore, 


u dH > ■ 

H> = W* n 


dH J dHj 
dqk ~ 9q k ' 


X 


XijHjSHidv 


<5qk f XijHj^^-dv 

Jv 9q k 

s <*wJv^ AAdv - 


It is then clear that if we define a dissipation function 


(13) D = \ J v XijHjHi dv , 

then 

(14) / XijHjdHi dv = ^ 6q k ■ 

Jv 9q k 

The application of generalized coordinates will be illustrated in Sec. 14.13. 

It may appear unnatural to introduce the vector field Hi rather than 
vary the temperature 9 directly. A little reflection will show, however, that 
the thermal evolution is determined by the heat flow. The three components 
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of the heat flow vector are capable of independent variations. Hence, as a 
proper choice of variables, we use the components of the heat flow vector. 
Equivalently, the three components of the temperature gradient may be 
used also, but then the analysis will be purely formal; the functionals do 
not have as simple an interpretation as those encountered above. 

14.12. COUPLED THERMOELASTICITY 

A variational equation corresponding to the basic equations of Sec. 14.1 
is given by Biot under the assumptions that the elastic constants Cijki , the 
thermal stress constants the specific heat per unit mass at constant 
strain C v , and the heat conductivity matrix (&y) or its inverse (Ay) = 
{ kij)~ l , are independent of the temperature and time, and that the tem¬ 
perature change 9 is small compared with To, the absolute temperature of 
the reference state. Under these assumptions, Eq. (14.1:17) can be inte¬ 
grated to give 

(1) = pC v 8 + To/?yey , 

where 



Hi = 0 when 9 — = 0. 

The vector Hi so defined is proportional to the entropy displacement. Biot’s 
principle now states that the thermoelastic equations are necessary condi¬ 
tions for the following variational equation to hold for arbitrary Sui and 
6Hi, except that Sui = 0 over the portion of the boundary where the dis¬ 
placement Ui is prescribed, and 5H, = 0 where the heat flow is prescribed: 

(2) + = j (X i -pu i )8u i dv + J (Ti Sui - 9o^u^j dS, 

where 

(3) Y = J C l3 kiCijCki — fiijOijO + — — ^ dv , 

(4) 89= f ijHjSHidv, 

Jv do 

6 o is the boundary value of 6, Vi are the direction cosines of the outward 
normal to the surface B (see Sec. 14.11), and Y is the thermoelastic potential 
of Biot. On solving Eq. (1) for 9, substituting 9 and ey = into 

Eq. (3), and varying it; and Hi, the statement above can be verified readily. 
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We may also use the method of Lagrangian multipliers as in Sec. 14.10. 
The details are left to the reader. 

Note that if Eq. (2) is integrated with respect to time between an arbi¬ 
trary interval (£o)£i)> under the stipulation that Su, = 0 at t = t 0 , t 1( the 
term involving the inertia force —pv,i can be expressed as —5JXf, where 
is the kinetic energy. Then a formula similar to the Hamilton’s principle, 
now including a dissipation function, is obtained. 

Biot 14 ' 4 derived the expressions above for V and from a reasoning 
which has been discussed in Sec. 13.5. For small deviations from equilib¬ 
rium, the equation of evolution (13.5:13) is a necessary condition for the 
variational equation 


( 5 ) 


dY d9 _ 

- 1 - Q 

dqi 8<ji ^ 


Sqi = 0 


to hold for arbitrary variations Sq j. It was shown in Sec. 13.5 that Y is 
the sum of <? - To 5? over the entire system, and 2^/To is equal to the 
rate of entropy production. Although no heat input on the boundary was 
considered in Sec. 13.5, the generalization of Qi to include heat conduction 
on the boundary is justified by the theorem just proved above. 

Let us now evaluate V — f p(S — Toy)dv, when a temperature incre¬ 
ment 8 (xi,X 2 ,X 3 ',t ) is imposed on the body without changing any other 
state variables. The absolute temperature distribution is To + 0. To change 
the temperature by d9 at a particular point requires an amount of heat 
pC v dQ per unit mass, where C v is the specific heat per unit mass at con¬ 
stant strain. Hence, the change of internal energy per unit volume, S, is 
pC v <W, the change in completementary work —jdije^dO and the change of 
entropy per unit volume is -pC v d9/(T 0 + 6 ). Thus, the imposition of a 
temperature increment 6 changes V by 


(6) r c = j v dv (^J° P C V do - To - J° fceij de 

= L dv {l nTd d0 "l pijeiid °) • 

If -C To, then 

(7) V c ±\ j^^-dy-^eijO. 


The total value of V is obtained by adding to % the value of V at the 
constant temperature To. The latter is the classical Helmholtz free energy 
at Tq. The term V c is familiar to power engineering: it represents the 
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heat that may be transformed into useful work. Indeed, T, is an integral 
of the heat pC v dQ multiplied by the Carnot efficiency 9 /(To + 0) = 9/To . 
integrated first with respect to 6 from 0 to 9, then over the region V. 

The special treatment of the dissipation function 3 in the variational 
equations should be noted. The variational invariant S3 was defined di¬ 
rectly. When the equations of motion are written in the Lagrangian form, 
Eq. (5), 3 can be identified as a functional which is equal to To/2 times the 
rate of entropy production of the entire system [see Eq. (13.5:11), assuming 
9 To]. From Eq. (13.2:6), the entropy production per unit volume is 
—hjTj/To, where hj is the heat flux and Tj is the temperature gradient. 
On introducing Fourier’s law of heat conduction in the form Tj = — Xjthi , 
the rate or entropy production per unit volume is \j[hjh[/To. Hence, 

( 8 ) g = ~ J X^hjhtdv. 

In Sec. 14.11, for the boundary condition over B — B\, where the wall 
temperature is unspecified, it is required that the heat flow be specified 
exactly. For a variational approach, this boundary condition can be relaxed 
by including in the dissipation function a term corresponding to entropy 
production at the boundary. Let there be a heat flux per unit area hjVj 
leaving the body at a boundary, where Vj is the direction cosine of the 
outer normal to the surface. Let T a be the temperature outside the heat 
transfer layer at the wall and T be that of the body inside. (For example, 
T a represents the adiabatic wall temperature in the boundary-layer flow in 
aerodynamic heating.) If T a / T, the entropy production per unit wall 
area is —hjVj(T~ l — T ,~ 1 ). A phenomenological law may be posed as 

( 9 ) = = 

Therefore, the contribution of heat transfer at the boundary to the dissi¬ 
pation function is 



If 3b is added to 3 from (8), we have 


(H) ® = X ^ hldv + y / fl ’ 

which must be used if the heat flow at the wall is to be considered as a 
natural boundary condition. When the expression (11) is used, the variation 
6 Hi can be regarded as arbitrary over the entire boundary. 
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14.13. LAGRANGIAN EQUATIONS FOR HEAT 

CONDUCTION AND THERMOELASTICITY 

When n generalized coordinates qi,q 2 , ■ ■ ■ ,q n are introduced to repre¬ 
sent the displacement vector and heat flow vector by expressions 


( 1 ) 


m — Ui(qi,q 2 ,... ,q n -,xi,x 2 , x 3 ) , z = 1,2,3, 

Hi = Hi(qi,q 2 ,... ,q n ;xi,x 2 ,x 3 ) , i = 1,2,3. 

the variational principles of the preceding section may be written as 
dt d9 

< 2 > w t + wr Qi ' i=1 ' 2 ."■ 

where and @ are given by Eqs. (14.12:3) and (14.12:8) respectively, and 

(3, -Xc* -pw|** + / b (fi £ + 

The applications of these Lagrangian equations are well-known for the stress 
problems (Chapters 10 and 11). In what follows we shall give illustration 
to heat conduction problems. 


Example 1. Prescribed, Wall Temperature 


Consider a semi-infinite homoge¬ 
neous, isotropic solid, with cons¬ 
tant parameters k and C v , initially 
at a uniform temperature 0 = 0 
(Fig. 14.13:1). The boundary at x = 
0 is heated to a variable temperature 
0o (t). On ignoring the inertia force 
and thermoelastic coupling, it is de¬ 
sired to find the temperature distri¬ 
bution in the half-space. 

We propose to solve this problem 
approximately. Following Biot 14,4 
(1957), we assume the temperature 
distribution to be parabolic and rep¬ 
resented by 

0(x,t)=O o (t) [l 



Fig. 14.13:1. Penetration of heat in 
one-dimen8ional flow. 



0(x, t) = 0 


( 4 ) 


* > 9i • 
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The function qi(t) can be interpreted physically as a “penetration depth,” 
and will be taken as our generalized coordinate (see Fig. 14.13:1). 

The heat flow vector Hi is defined by Eqs. (14.11:7), (14.11:9), and 
(14.11:10) 

(5) Hi = hi, -H i<i =pC v d. 


In the one-dimensional problem posed here, we may assume that only one 
component Hi is different from zero and that H\ is a function of x alone. 
Hence, from Eq. (5), 


( 6 ) 


dHi 

dx 


-pCJ. 


For x > qi, thermal equilibrium is undisturbed so that H\ = 0. Hence, 
Eqs. (6) and (4) can be integrated to give 

(7) H^xH) = pC v £ (t) = pC v 6 0 (J-x + ^-^ 2 ) ■ 

To evaluate y and it is sufficient to consider a semi-infinite cylinder of 
unit cross section with axis parallel to x. Hence, 



which, from Eq. (7), is 


P 2 cl r 
2kTg Jo 
p 2 C 2 

~ 2kT 0 qi 


8 o 


13 

315 


£1 

3 

<iX 


X + 


<h 3?i 


+ Ml ( g 




x 2 2x 3 \ 

7i + ^!J 


2 

dx 


The generalized force Q\ is, according to Eq. (3), 



Hence, the Lagrangian equation for heat conduction (2) is 


( 11 ) 


PCX p 2 C 2 
10To 2kT 0 Ql 


2i5 qi6o + n qidoe ° 


pCyOl 

3 To 


Let 

( 12 ) 


z = q 1 ■ 
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Then Eq. (11) reduces to 
(13) 


. 150o 147 k 

Z+ l3h Z ~ 


This is a standard differential equation with an integration factor 
J (15/13)(9o/9o)<ft = [0 o(4 )]15/13 . 


15/13 


i.e., 

—r^ 15/13 i = ~—t 
dt [ °° 1 13 pCv 

Hence, on noting the initial condition q\ = z — 0 when t — 0, we obtain 

the solution 


(14) ^ = ^^^ o(t)rl5/13 / t[ ^ o( ^ )]15/13£f ""- 

If it is assumed that the wall temperature follows a power law, 


(15) 

9o(t) = at n , 

n> 0 

we find 



(16) 

2 147 A: t 


q '- Z ~ 13 pC^n+l' 



This shows that the penetration depth q\ varies with \ft. It is indepen¬ 
dent of a and depends on the exponent n only through a constant factor 
[y|n-f l] -1 / 2 . The case where 9 0 = const, corresponds to n — 0 and yields 
the result 

(17) 9i = 3.36 

Comparison of this simple solution, with only one generalized coordi¬ 
nate, with the exact solution of the problem was made by Biot 14,4 (1957) 
who shows that the approximation (4) is valid if the temperature increases 
or decreases monotonically. Biot also points out that if this is not the case, 
one may split up the time interval into segments for which 0o varies mono¬ 
tonically, and then apply Eq. (13) to each segment, using the principle of 
superposition. Use can also be made of the power law solution (16) by 
dividing the time history of the temperature into segments, each of which 
may be approximated by a power law, or by an additive combination of 
such terms, including the constant value. 
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Example 2. Prescribed Heat Flux 



Fig. 14.13:2. Heating of a half-space 
with prescribed heat flux. 


The semi-infinite solid described 
in Example 1 is heated at the wall 
x = 0 with a uniform, constant 
heat flux h x = F at the wall (see 
Fig. 14.13:2) (Lardner, T. J., AIAA 
J. 1 , 1, 1963, p. 196-206). 

The problem is one-dimensional 
with one component of the vector 
H, which will be written as H. 
Then 

-pC v 6. 


(18) 


dH 

dx 


Again, we assume a parabolic temperature distribution 


(19) 


6 (x,t) = A(t) 


1 - 


Qi(t) 


with qi representing a “penetration depth.” The function A(t) must be 
so chosen to satisfy the boundary condition at x = 0. An integration of 
Eq. (18) gives 

whence 

(21) (.,-A = i^, 1 (,-i) 3 + ^,( i-i) 3 

+ xA pCyq\ L _ jc \ 2 

9i V 9i / 

At x = 0, the boundary condition h x — F requires that 
(• Mi + 4iA)^—~ = F , 


or 

( 22 ) 


A = 


3 Ft 

pCvQi ' 


On substituting this expression for A into Eqs. (19) and (20) and then 
computing the thermal potential y and dissipation function St according 
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to Eqs. (14.12:3) and (14.12:11), we obtain 


dr 


9F 2 t 2 


dqi 10 pC v q\ ’ 

dS _ FH /3qit _ 3 _\ 

dqi ~ k \35gi + 42/ ’ 


Qj = — t. 

10 


k 

P C V 


'3 . , 3 2 

K qiqit+ 42^1 


The governing differential equation (2) becomes 

(23) 

The solution for the penetration depth is 

(24) qi = 2.81 
The surface temperature is 


9 

“ 10 


GSc) 


1/2 


. 3 Ft . ( kt 

<25) e ° = 7crr 1065 r 


1/2 


The exact solution is known to be 
(26) 0 O — 1-128 


f / kt y /2 
T\pC v ) ' 


PROBLEMS 

14.11. Consider a slab of thickness 6 , heated on the surface x = 0 . Initial 
condition at t = 0 is 0 = 0 throughout the plate. The boundary condition at the 
surface x = 0 is 6 = 6o(t) for t > 0 (same as in Example 1), and that at x = b is 
h x = 80/dx = 0. At certain time to, the penetration depth qi will be equal to b. 
For time t > to, the temperature profile will be as shown in Fig. P14.11. Obtain 
an approximate solution by taking <73 as the generalized coordinate. 

14.12. Consider the same problem as in Problem 14.11 except that the 
boundary condition at x — 0 is, as shown in Fig. P14.12, h x = F, a const, (same 
as in Example 2). 

14.13. A long, circular, cylindrical, solid-fuel rocket, upon curing, releases 
its heat of polymerization throughout its mass at a steady rate, say, Qo- The 
external surfaces may be assumed held at constant temperature To. Assuming 
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P14.11 P14.12 


that the cylinder is so long that over the body of the cylinder the heat flow is es¬ 
sentially two-dimensional, determine approximately the temperature distribution 
in the cross section of the cylinder. 

Consider other cylindrical rockets whose cross-sectional shapes are (1) an 
isosceles right triangle, (2) an equilateral triangle, (3) a square. Let all of these 
rockets be made of the same material and have the same cross-sectional area. 
Compare the steady-state temperatures at the centroids of these rockets. 
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VISCOELASTICITY 


In this chapter we shall generalize the ideas discussed in Sec. 1.2 to deal with 
a three-dimensional continuum. It is popular to call a material that obeys 
a linear hereditary law viscoelastic, although the logic of this terminology 
is by no means certain. The origin of the term lies in the simple models 
such as those of Maxwell, Voigt, or the standard linear solid, which are 
built of springs and dashpots. The springs are elastic, the dashpots are 
viscous; hence the name. Such an etymology is entirely different from 
that of viscoplasticity, hyperelasticity, hypoelasticity, elastic-ideally plastic 
materials, etc. However, the terminology of identifying a material obeying 
a linear hereditary law as viscoelastic is popular and well-accepted. 


15.1. VISCOELASTIC MATERIAL 

Let a rectangular Cartesian frame of reference be chosen, and let the 
position vector of a point be denoted by {x\, X 2 , X 3 ). A function of position 
f(x\,X 2 ,xj) will be written as /(x) for short. Let < 7 ^ and be the stress 
and strain tensors defined at every point x of a body and in the time interval 
(—00 < t < 00 ). The strain field ejj(x, t ) and the displacement field Uf(x, t), 
as well as the velocity field Vi(x, t), will be assumed to be infinitesimal, and 


( 1 ) 


2 ( u i,j T Uj,i), 


where a comma indicates a partial differentiation. Under the assumption 
of infinitesimal strain and velocity, the partial derivative with respect to 
time dey/dt is equal to the material derivative e^ within the first order. 
Now we define a linear viscoelastic material to be one for which (x, t) is 
related to ey(x, t) by a convolution integral as follows. 


(2) A = [ G ijkl (x,t - r)^-(x,r)dr, 

J -00 or 

where G^m is a tensor field of order 4 and is called the tensorial relaxation 
function of the material. Equation (2) is called the stress-strain law of the 
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relaxation type. Its inverse, 

(3) A 6 ij(x,t) — j Jijkl{x,t~T)~~(x,T)dT , 

if it exists, is called the stress-strain law of the creep type. The fourth-order 
tensor Jijki is called the tensorial creep function. It can be shown (Stern¬ 
berg and Gurtin 151 1962, Theorem 3.3), that the inverse (3) of Eq. (2) exists 
if Gijki(x,t) is twice differentiable and if the initial value of Gijki(x,t) at 
f = 0 is not zero. 

The lower limits of integration in Eqs. (2) and (3) are taken as —oo, 
which is to mean that the integration is to be taken before the very begin¬ 
ning of motion. If the motion starts at time t = 0, and = eij — 0 for 
t < 0, Eq. (2) reduces to 

(4) (Tij(x,t) = eki(x,0+)Gijki(x,t) + f Gijki(x,t - r)~^ (x,r)dr , 

Jo or 

where ey(:r, 0+) is the limiting value of e l j(x, t) when t —► 0 from the 
positive side. The first term in Eq. (4) gives the effect of initial disturbance; 
it arises from the jump of eij(x,t) at t = 0. In fact, it was tacitly assumed 
that eij(x,t) is continuous and differentiable when Eq. (2) was written. 
Any discontinuity of ey (x, t) in the form of a jump will contribute a term 
similar to the first term in Eq. (4). For example, if e i 7 (x, t) has another 
jump Aeij(:r,£) at t = £, as shown in Fig. 15.1:1, while Gijki and deij/dt 
are continuous elsewhere, then we have 

(5) Oij(x,t) = eki(x,Q+)G ijk [(x,t) + Ae k i(x,^)G i: jki(x,t - £)l(t - £) 

+ J G ijk i(x,t-T)^ L (x,T)dT, 

where l(t) is the unit-step function. In such cases the following forms, which 
are equivalent to Eq. (4) when deij/dt, dGijki/dt exist and are continuous 



Pig. 15.1:1. Illustration for a loading history with jumps. 
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in 0 < t < oo, may be used to advantage: 

(6) aij (x, t) = Gijki ( x , 0 )e ki (x, t) + J e k i (x, t - t) (x, r) dT 

e k i(x,t - T)G ijk i{x,T) dr . 

These constitutive equations are appropriate for isothermal conditions. 
For the influence of temperature, see references listed in Bibliography 15.2, 
p. 505. 

A viscoelastic material is defined by a specific relaxation function or 
creep function. In the following discussions we shall regard Gijki or J t j k i as 
experimentally determined functions. Restrictions imposed on these func¬ 
tions by thermodynamics, as well as their origin in hidden coordinates, are 
discussed in Chapter 13. 

In the treatment of viscoelasticity, we would have to write many convo¬ 
lution integrals. A shorthand is therefore desired. We shall introduce the 
notation of composition products. Let (p and ip be functions defined on the 
intervals 0 < t < oo and -oo < t < oo, respectively, and let the integral 

(7) I(t)= J <p(t-T)^(T)dT + (p(t)ip( 0) 

exist for all t in (0,oo). Then the function I(t) is called the convolution of 
(p and ip and is denoted by a “composition” product 

(8) I(t) = <p* dip. 

The integral in Eq. (7) may be understood in Riemannian or Stieltjes’ sense, 
the latter being more general and better suited to our purpose. Sternberg 
and Gurtin 151 developed the theory of viscoelasticity on the basis of Stielt¬ 
jes’ convolution. 

The following properties of the convolution of <p with ip and 0 (also 
defined over — oo < t < oo) can be verified. 

(9) <p * dip = ip * d<p (commutativity), 

(10) <p * d(ip * dO) — (<p * dip) * d6 = 4> * dip * dd (associativity), 

(11) <p* d{ip + 0) = <p * dip + <p* dO (distributivity), 

(12) <p* dip = 0 implies 

<p = 0 or ip = 0 (Titchmarsh theorem). 
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With these notations, Eqs. (2) and (3) may be written as 

(2a) A <7ij = Gijki * dejci — * dGjjki , 

(3a) A Cij = Jijki * dtjki = &ki * dJijki • 

The symmetry of the stress and strain tensors either requires or per¬ 
mits that 

( 13 ) Gijki ~ Gjm — Gijik , 

(14) Jijki ~ Jjikl ~ Jijlk • 

Furthermore, it is natural to require that the action of a loading at a 
time to will produce a response only for t > to. Hence, we must have 

(15) Gjjki = 0 , Jijki =0, for — oo < t < 0. 

This requirement is sometimes called the axiom of nonretroactivity. 

If G^ki is invariant with respect to rotation of Cartesian coordinates, 
the material is said to be isotropic. It can be shown that a material is 
isotropic if and only if Gijki is an isotropic tensor. A fourth-order isotropic 
tensor with the symmetry properties of Eq. (13) can be written as 

(16) G^i = G2 ~ Gl 5,j6 k i + ^ Mi + SnSjk), 

where G i, G 2 are scalar functions satisfying Eq. (15). If Eq. (16) is substi¬ 
tuted into Eq. (2a), we obtain the stress-strain law which can be put in the 
form 


( 17 ) A 


o'ij = e'ij * dG 1 — Gi* de'ij , 


&kk — £kk * dGi = G2 * dekk , 


where <jC and eC are the components of the stress and strain deviators: 

( 18 ) ( 7 ij = < 7 j j ~hij< 7 kk » ('ij — &ij ~~ ■ 

The functions G\ and Gi, are referred to as the relaxation functions in 
shear and in isotropic compression, respectively. 

The corresponding stress-strain laws of the creep type for an isotropic 
material is 


e'ij = cry * dJi =Ji* dcr'ij , 


( 19 ) A 


efcfc = o-fcfe * dJ2 = J 2 * d<7 k k 
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where J\, J 2 are called the creep functions in shear and in isotropic com¬ 
pression, respectively. 

If Gijki, Jijki or G\, <? 2 , Ji, J 2 are step functions in time, then the 
stress-strain laws (2), (3), (17), and (19) reduce to those of linear elastic 
solids. 

The Fourier transforms of G and J can be interpreted as the responses 
to appropriate harmonic forcing functions. Within a certain range of fre¬ 
quencies, measurements of harmonic responses of many materials are fea¬ 
sible. The bulk of our information about the viscoelastic behavior of high 
polymers is presented in the form of frequency responses. 


15.2. STRESS-STRAIN RELATIONS IN DIFFERENTIAL 
EQUATION FORM 

In Secs. 13,8 and 13.9, it was shown that when the relaxation function 
consists of a finite discrete spectrum, the stress-strain relation may be put 
in the form of a differential equation. Simple examples are the Maxwell, 
Voigt, or the standard linear models discussed in Sec. 1.2. A more gen¬ 
eral expression may be given as follows. Let D denote the time-derivative 
operator, or differential operator defined by 


( 1 ) 


Df = 


m) 


D 2 f 


d 2 f 


etc., 


dt ’ " J dt 2 ’ 

where / is a function of time. Let us consider the polynomials 


mi 

Qi{D) = J^b k D k , 

fc=0 

(h(D) = 

*=0 

where a k ,b k , c k , <4 are real-valued functions of the spatial coordinates aq, 
X 2 , £ 3 ■ We assume that the coefficients a ni , b mt , c„ 2 , d m3 are different 
from zero, so that Pi, P 2 , Q 1 , Q 2 are polynomials of degree ni, n 2 , mi, 
m 2 , respectively. Then the stress-strain relations 


P X (D) = 5> fc Z? fc , 

k =0 

n 2 

Pi{D) = 


(3) A Pi(D)o' ij — Qi(.D)e(j, P 2 {D)<J k k = Q 2 {D)e k k 

specify an isotropic linear viscoelastic material. Here cry, ey, cr^, ey 
[see Eq. (15.1:17)] shall be understood to be functions of x\, x 2 , X 3 and 
t. In Eq. (3), the dependence on x is not explicitly shown, but shall be 
understood. 
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Fig. 15.2:1. Generalized Kelvin and Maxwell models. 


If we consider the load-deflection relationship of a network of springs 
and dashpots such as those pictured in Fig. 15.2:1, it can be shown that the 
load a and the deflection e are related by an equation of the form of Eq. (3) 
(see Problem 15.1). For this reason, a viscoelastic body is often represented 
by a mechanical model. Naturally, not all conceivable polynomials (2) may 
represent physically realizable systems. Some restrictions are imposed by 
thermodynamic considerations, as discussed in Sec. 13.9 and Problem 13.5. 

If the stress-strain law for a given material can be expressed in both 
the integral form and the differential form, then the relationship between 
the relaxation (or creep) function and the differential operators can be 
determined. The result is particularly simple if we assume that 

(4) o’ij(t) = eij(t) =0 for t < 0, 


and that the Laplace transformation (with respect to time) of all the func¬ 
tions concerned exist. Let the Laplace transformation of a function /(f) be 
indicated by a bar, thus 



e~ st f(t)dt, 


Rl s > sq , 


where sq is the abscissa of convergence of the Laplace integral. 
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The Laplace transformations of Eqs. (15.1:17) and (15.1:19) are 

(5) a'^s) = s<5i(s)gy(s), o-fcfe(s) = sG 2 {s)e kk {s ), 

(6) ey(s) = sJi(s)a' tj (s) , e kk (s) = sJ 2 (s)a kk (s ). 

The Laplace transformations of Eqs. (3) are 


"i f da' d k ~ x a' 

(7a) A(*)^(-) -£> s k ~ 1 aij(0) + s k ~*-£- (0) + ■ • • + (0) 

Jc—i L 

= Qi( s )e'ij( s ) 

- £ +...+^2(0)1, 


(7b) P 2 (sfaj (s) ~Yl Ck sk 1 a 33 (°) + • • • + ■ 8t k-?' (°) 


= Q 2 (s)e jj (s)-'£d k s k -'e jj (0) + -.. + t^(0) , 


where 

ni mi 

Qi( s ) = Y^hs k , 

k=0 k~Q 

(8) 

P 2 (s)=jr Ck s k , Q 2 ( s )^f^d k s k , 

k =0 k=0 

and ^(O), (da' i j/dt)( 0) etc., are the inital values of crL, dcr'^/dt, etc., i.e., 

the value of crL (x, t ) and its time derivatives as t —l 0 from the positive side. 

Whether Eqs. (7) should be identified with the relaxation law (5) or 
with the creep law (6) depends on whether nj > mi, n 2 > m 2 or not. If 
ni >mi, then Eq. (7a) may be written as* 


(9) * 'W-TrIH' W ’ 

•Note that a polynomial of s has no continuous inverse function. The inverse of s is i5(i); 
those of s 2 , s 3 are higher order singularities. 
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provided that the following initial condition holds: 

d k ~ l (7‘ 


( 10 ) 


£< 


a 'ii(Q) H-t- 


dt k 


:f(0) 


— S k 1 (0) “f h 


d k - x e\ 

dt k ~ l 


-( 0 ) 


= 0. 


In this case we may identify Eq. (9) with the relaxation law (5), with 


(11) A 


Gi(s) 


Qi(s) 

sPi(s)' 


The initial condition (10) is an identity as a polynomial in s; every coeffi¬ 
cient of the polynomial must vanish. Thus, if m\ = m, 


(12) A 


a m £T ij(^) ~ ^niCy(O) > 


d ni ~ l o' d ni ~ 2 (j'- 

rrr(°) + a m -1 -2 5 (°) + ••• + aicr'j(0) 




d ni ~ x e' ■ 

bn ' ~g^7^r (°) +- fc i e ij(°) • 


If mi < n\, then those coefficients 6* in Eqs. (12) with subscript k > m\ 
must be replaced by zero. 

In the alternative case ni < mi, Eq. (7a) may be written as 


(13) 


, . Pi{s) . 
’ (s) = 


provided that the initial condition (10) holds. We may identify Eq. (13) 
with the creep law (6), with 


(14) 


Ms) 


A 00 

sQi(s) ' 


If ni = mi, then the material may be represented by stress-strain laws of 
both the relaxation and the creep type. 

An analogous situation exists for the laws governing the mean stress 
and mean strain, <7kk and e kk- 

The initial conditions (12) are interesting and important. They repre¬ 
sent the proper initial conditions that must be imposed when Eqs. (3) are 
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regarded as differential equations to solve for , with e' tJ regarded as given 
forcing functions; or vice versa. 

What is the physical significance of the initial conditions (12)? We 
recognized their necessity through an identification of Eq. (15.1:17) in the 
preceding Section, which is valid for t in (—oo, oo), with Eq. (3) of this 
Section, which applies for t in (0, oo). For the purpose of identification we 
have assumed e'^ = a[ - = 0 for t in (-oo,0). If 


(15) 


dt 


then Eqs. (12) imply 


(16) 


d ni 


-Iz,' 


dt n '~ x 


= 0 


dt n '~ x 


= 0 


at t = 0, 


at t = 0. 



Thus the transition is smooth, as expected. The surprising aspect of 
Eqs. (12) arises only when e-(t) is continuous in (0, oo) but the limiting 

values e^(0+), (de' i:j /dt){ 0+), etc., 
are nonzero as t -> 0 from the 
positive side. In this case there is 
a jump in the value of e' tj or its 
derivatives in the neighborhood of 
t = 0. The initial conditions (12) 
must stem from the fact that the 
differential operator in Eq. (3) is ex¬ 
pected to hold, in a certain sense, 
during the jump. 

To see that this is indeed the 
case, let us smooth off the jump in 
an interval (—e, e) (see Fig. 15.2:2). 

Let e' i: j (t, e) be a smooth func¬ 
tion (bounded, continuous, and ni 
times differentiable) in the range 
(—e, e ), which has the property 
that ey ( t , e) and its derivatives 
vanish at the left end, t = —e; 
whereas assume the values e(j(0+), 

(de , i j/dt)( 0+), etc., at the right 



Fig. 15.2:2. Smoothing of a jump. 


end, t = +e. Such a function serves to describe the jump if we let t-iO. Let 
crL(t, e) be the corresponding smoothing function describing the jump in the 
stress deviation. We now assume that e^ (t, e) and <jL(t, e) are connected 



496 


VISCOELASTICITY 


Chap. 15 


by the viscoelasticity law (3); i.e., 


(17) 


N 

k =0 


dt k 


= S> 


k=0 


dt k 


where N is equal to the larger of mi, and ni, with some of the coefficients 
ak, bk set equal to zero to eliminate any terms which may be absent from 

Eq. (3). 

An integration of Eq. (17) with respect to t gives 


(18) 


N 

Y^a k - 


dt fc_1 


4* &o 


e) dr 


= J2bk 


dt k ~ l 


'0 f 


e) dr. 


Now let t = e, and take the limit e 0. Since cr' i j(t,e) ) eij(t,e) are bounded 
in (—«, e), the integrals in Eq. (18) are of order e and vanish as e -» 0. 
Therefore, at t = e and in the limit e -> 0, we obtain the necessary initial 
condition 


" d k ~ l , 
l^ ak dt k - 

k= 1 


( 0 +) 


N 

J2 bk 

fc=i 


d k 


dt k 


rf (0+), 


which is one of the equations in Eq. (12). Integrating Eq. (18) again from 
—e to e and repeating the arguments as above, we obtain the second initial 
condition 


^ d k ~ 

l. ak 

k =2 


d k- 


dt k ~ l 


(°+) - YL bk ~d t k-2 (°+) ■ 


All Eqs. (12) can be obtained in this way by repeated applications of the 
same process. 


Problem 15.1. Let o denote loads and e denote displacements. Show that, 
for the generalized Kelvin model shown in Fig. 15.2:l(a), 

Goei = o, G \&2 + rjiDe? — o, ,.., e = ei + + • • ■ + e n , 

so that 

/ J_ 1 1 1 

y Go G\ 4* tjiD G n 7} n D Ti n +iD 
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For the generalized Maxwell model shown in Fig. 15.2:l(b), we have 


G'o + 


v'iDe'{ = 

a i, 

( 1 , 

1 \ 

= {g[ + 


+ • • • + <r n+ i 

GipiD 


Gi + r]\D 

+ + C 


G n Tj n D 
G n + VnD 


+ r) n +\D ] e. 


Reduce these relations to the form of Eq. (3). 

Problem 15.2. Let a special case of Eq. (3) be specified by 

Pi cry = fflacfy + ai&ij + ao(Tij , 

Qi&ij = b^Cij + byeij + boe,j , 

where a dot denotes a differentiation with respect to time. Deduce proper initial 
conditions. Interpret the usual statement that “the initial response of a viscoelas¬ 
tic body is purely elastic.” 


15.3. BOUNDARY-VALUE PROBLEMS AND INTEGRAL 
TRANSFORMATIONS 

The motion of a viscoelastic body is governed by the laws of conserva¬ 
tion of mass and momentum, the stress-strain relations, and the boundary 
conditions and initial conditions. Let iq, ey, cry, and Xi denote the Carte¬ 
sian components of the displacement, strain, stress, and body force per unit 
volume, respectively, and p be the mass density. Let us restrict our consid¬ 
eration to homogeneous and isotropic viscoelastic solids, and infinitesimal 
displacements. Under these limitations the field equations are: 

(1) Definition of strain: eij = A(iqy + ryy), 

(2) Equation of continuity, ^ ) = 0, 

(3) Equations of motion: aijj + Xi = , &ij — &ji- 

The stress-strain relation may assume any of the following forms (see 

Sec. 15.2): 

(4a) Relaxation law: crL = eC * dG\, o kk = e kk * dG 2 , 

(4b) Creep law: ey = cry * dJi, e kk = a kk * dJ 2 , 

(4c) Differential operator law: Pi(D)o' i j = Qi(D)e' i j, P 2 (D)o kk = 
Q2{D)e kk , 
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where eC, are the stress and strain deviators defined by 

(b) — <J\j — —&ij&kk, e ij — e ij ~ U e kk , 

G i, t ?2 are the relaxation functions, J\, Ji are the creep functions, and Pi, 
p 2 , Q i, Qi are polynomials of the time-derivative operator D as discussed 
in Sec. 15.2. In view of the assumed homogeneity of the material, these 
functions and operators are independent of position. 

If the body is originally undisturbed, then the initial conditions are 

(6) Ui = — <Jij — 0 for — oo < t < 0 . 


If the differential operator law (4c) is used and there is a jump con¬ 
dition at t — 0, then the initial conditions may assume the form of spe¬ 
cific assigned values of eC(O-f), (0+),..., ^f 2 - (0+), and of crC(0+), 
(0+), which must be connected by the necessary 
conditions [Eq. (15.2:12)]: 


( 7 ) 


" d k ~ r > 
^ ak ~dF- 

k=r 


-(o+) = J>- 

k—r 


dt k 


rf(0+), 


r = 1,2,... ,n. 


where n is the larger of the degrees of Pi (D) and Q\(D). Similar statements 
hold for initial values of mean stress and mean strain. 

The boundary conditions may take the form of either assigned traction 
over a surface S„ with an outward-pointing unit normal vector i->i , 


(8) Ti= (7{jUj = fi over S a , 

or of assigned displacement over a surface S u , 

(9) Ui — gi over S u , 

where fi, gi are prescribed functions of position and time, and S a -I- S u = S, 
the total surface of the body. 

The problems in the theory of linear viscoelasticity usually consist of 
determining Ui, eij, and cry for prescribed Xi, fi, gi and initial conditions. 
Except for the stress-strain law, these same equations occur in the linearized 
theory of elasticity. 

In the lineared theory of elasticity we have applied the Laplace trans¬ 
formation to solve dynamic problems. The transformation translates an 



Sec, 15,3 BOUNDARY-VALUE PROBLEMS AND INTEGRAL . .. 499 

original problem involving derivatives and convolution integrals with re¬ 
spect to the time t into an algebraic problem with respect to a parameter 
3. After the algebraic problem is solved, the solution is translated back to 
the time domain. The last step, of course, may not be easy. 

It appears natural to apply the Laplace transformation also to problems 
in linear viscoelasticity. In fact, the transformed problem, involving the 
parameter s, can be put into the same form as that in the theory of linear 
elasticity. If the latter problem can be solved, the viscoelasticity solution 
can be obtained by an inverse transformation from s back to the time 
domain. The last step, again, may not be easy. 

The Fourier transform may be used in place of the Laplace transform. 
The selection of the appropriate transform depends on the nature of the 
functions involved. A function which has a Laplace transform may not 
necessarily have a Fourier transform, and vice versa. For example, the 
unit-step function 1(f), or a power t n , has no Fourier transform in the 
interval (0, oo). 

The identification of a problem in linear elasticity with one in viscoelas¬ 
ticity in the transformed plane, is called the correspondence principle. Ap¬ 
plications of the correspondence principle will become apparent by exam¬ 
ining some examples in Secs. 15.4 and 15.5. 

Problem 15.3. Assuming that the functions in, Gi, G 2 are continuous and 
can be differentiated as many times as may be desired, show that the equation of 
motion may be written (in the form of Navier’s equation) as 

d 2 m 

(xii) t jj*dG\+(uj,j),i*dK+2Xi = 2p ■ , i = 1,2,3, 

where 3 K = G\ + 2 G 2 . (Gurtin and Sternberg 1962 15 ' 1 ) 

Problem 15.4. From the strain equations of compatibility 
Cij,kk 4" ekk t ij Cik,jk Cjk.ik ~ 9 , 
the stress-strain relations 

e%j = CTij * dJ\ -f- —SijtTkk * d ( J'Z 3\ ) , 

and the equations of equilibrium 

<Tifc.fc = — Xi , 

deduce the stress equations of compatibility 


* dJ\ -f* (Tkk.ij * dA — , 
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where 


Qij = SijXk.k * dQ — (Xij + Xj t i) * dJi , 

3A — *2J\ -f- Ji , 

912 — Ji * d(J2 — Ji) * d{Ji + 2 J 2 ) 1 . 

(Gurtin and Sternberg 1962 151 ) 

Problem 15.5. Let u(ui), e(e,j), cr{<7ij) satisfy the equations of viscoelas¬ 
ticity, and G\ ^ 0, 2Gi + G 2 ^ 0. Show that, when the body force A' t is absent, 

V 2 (V ■ u) = 0, V 2 (V x u) = 0, V 2 ejtfc = 0, VV*,*, = 0, 

V 4 u = 0, V 4 e = 0, W = 0. 

(Gurtin and Sternberg 1962 15,1 ) 

15.4. WAVES IN AN INFINITE MEDIUM 

Let us consider a viscoelastic medium of infinite extent and look for 
solutions of Eqs. (15.3:1)-(15.3:6), in which all dependent variables, includ¬ 
ing body forces, vary sinusoidally with time. We shall use the complex 
representation for sinusoidal oscillations. Since cos tot is equal to the real 
part of e t<vt , a real-valued function f(x,t) that varies like cos ut at every 
point x may be represented as R1 [F{x)e lult \, where Fix) is real. A real¬ 
valued function fix, t) that varies like cos (ad + <j>) may be represented as 
Rl [,F(iE)e^ a>t+< ^] or as R1 \F[x)e xut ), where F(x) now stands for a complex 
number F(x)e l 'f It is elementary to show that 

Rl F(x) — F(x) cos <f>, Im F(x) = F(x ) sin (j >, 

Rl[F(a;)e iw< ] = RlFRle iwt - ImFlme^ 

= F CQscfcosutt — .Fsin^sinwf 
= F cos (ad + 4 >). 

Thus it is clear that the real part of F{x) is in phase with cos ut and 
that the imaginary part of F(x) leads cos wt by a phase angle <f> = 7r/2. 
In general, <j> is a function of x. In this way a sinusoidal oscillation of a 
solid medium whose amplitude and phase angle vary from point to point 
may be represented by a complex function F(x). A multiplication of F(x) 
by the imaginary number i means an advance of phase angle by 7 t/ 2; a 
multiplication by —i means a lag by 7r/2. 
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With these interpretations of complex representation we let 

(1) eij — R\(eije ia>t ), aij = R1 (ffye"*), etc., 

where e,j, etc., are complex-valued functions of the spatial coordinates 

only. The basic Eqs. (15.3:1)—(15.3:4) now read 

( 2 ) — — (v,ij + Uj t i), 

(3) &ij : j + Xi+ fXuJ 2 Ui = 0, 

(4) a[j = iu&ii^e'ij, 

d k k = iuG 2 (u>)e k k, 

where Gi(u>), G 2 (uj) are the deviatoric and the dilatational complex moduli, 
and are the Fourier transforms of the relaxation functions Gi(t) and G 2 (t), 
respectively. If we write 

(5) A(cv) = |<w[<5 a (w) - GiHl, G(u>) = , 

and call A(w) and G(oj) the complex Lame constants for a viscoelastic 
material, we see that Eqs. (2)~(5) are identical with those governing linear 
elasticity theory (Sec. 7.1), except that the Lame constants are replaced by 
complex moduli. Hence, without much ado, we can write down the Navier 
equation [cf. Eq. (7.1:9)] 

(6) G(u)ui t jj + [A(w) + G(u)]ujji + Xi + puj 2 Ui = 0 


and the solutions, in case Xi = 0, 



where A is a complex constant, rij is an arbitrary vector, and Cj are com¬ 
ponents of a vector perpendicular to n*, i.e., CjTij = 0. 

Equation (7) represents a plane dilatational wave. Equation (8) 
represents a plane shear wave. The exponential factors in Eqs. (7) and 
(8) are complex. A little reflection shows that the real parts of the factors 
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in front of rik,Xk, are the inverse of wave velocities, vq , vr ; whereas the 
imaginary parts are attenuation factors ao,ceu- 

(9) Dilatational waves: 



(10) Rotational waves: 



If the material is elastic, A(tv), G(u>) become real numbers, independent of 
co, in which case the attenuation factors and aji vanish and the wave 
speeds vp, vr are independent of frequency. 

Any formal solution of Navier’s equation in the classical theory of lin¬ 
ear elasticity, of the form / = Rl [f(xi,X 2 ,xz)e uat \, offers a corresponding 
solution for a linear viscoelastic body, if the elastic moduli that occur in 
f(x\, x 2 , X 3 ) are replaced by the corresponding complex moduli of the ma¬ 
terial. If the boundary conditions for the elastic and viscoelastic problems 
are identical, then a solution in viscoelasticity can be obtained by this cor¬ 
respondence principle. Note, however, that the operation of separating 
a complex modulus into its real and imaginary parts is one that has no 
counterpart in the theory of linear elasticity. Hence, such an operation is 
excluded from the correspondence principle. For example, if we wish to 
find the maximum of |/| with respect to u>, we must separate the real and 
imaginary parts, and no direct analogy will be available. 

Problem 15.6. Work out the details of the derivation of Eqs. (7) and (8) 
by reference to Sec. 7.8. 

Problem 15.7. Find the speed and attenuation factor for Rayleigh surface 
waves propagating over a viscoelastic half-space (see Sec. 7.9). Reference Bland, 
Linear Viscoelasticity 15 1 pp. 73-5. 

Problem 15.8. Consider a uniform cantilever beam the clamped end of 
which is forced to oscillate at a constant amplitude u>o (the clamping wall moves, 
while the other end is free). Fig. P15.8. 
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# 0 cos cot 


Pig. P15.8. 


For an elastic beam, the equation of motion of the beam is 


r <9 4 «; d 2 w 

El ——y + TO—- 5 - = 0, 
dx 4 dt 2 


where w is the beam deflection, m is the mass per unit length, E is Young’s 
modulus, find I the moment of inertia of the cross section. The boundary 
conditions are at x = 0 : 


at x — L: 


W = W 0 COS Ult, 


dw 

dx 


= 0 : 


d 2 w d 3 w 
dx 2 dx 3 


Determine a solution w(x,t) in the form w(x)e wt . 

If the beam material is viscoelastic, determine (a) the corresponding complex 
bending rigidity EI{u) from the stress-strain relationship of the material and (b) 
the solution w(x,t). 

The amplitude ratio and phase lag of the oscillations at the free and the 
clamped ends can be used for experimental determination of the complex modu¬ 
lus. Reference Bland and Lee, J. Appl. Phys., 26 (1955), p. 1497. 


15.5. QUASI-STATIC PROBLEMS 

If all functions of concern vanish for t < 0 and have Laplace transfor¬ 
mations, the transforms of the basic Eqs. (15.3:1)—(15.3:9) assume exactly 
the same form as in the theory of linear elasticity. In quasi-static problems, 
the loading is assumed to be so slow that inertia forces may be neglected. 
In this case, the corresponding elastic problems are static. 

Let the Laplace transform of a variable be denoted by a bar over it. The 
basic equations for quasi-static problems of a viscoelastic body occupying 
a region V with boundary B = B„ + B u are 


( 1 ) 

1 . _ , 
e ij — ^ \ u iJ + u j,i) 

inU , 

( 2 ) 

"h — 0 

in V, 
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(3a) 

cr' i:j = sGi(s)eL, a k k = sG 2 (s)e k k 

in V , 

or 



(3b) 

eL = sJi(s)d-L, e kk = sJ 2 (s)a k k 

in V, 

or 



(3c) 

Pi(s)a'j = Qi(s)e l i j , P 2 (s)5- fcfc = Q 2 (s)e’ kk 

in V , 

where 



(4) 

qi 

II 

on B„ , 

(5) 

Ui = 9i 

on B u . 


The boundary surface enclosing V is B = B a + B u . The initial 
conditions (15.3:6) and (15.3:7) are assumed to be satisfied both in V and 
on B. The material properties Gi(s), <5-2(s); or Ji(s), Ja(s); or P\ (s), 
P 2 (s), Q i(s), Q2(*')! are given. The problem is to determine ey, for 
assigned forcing functions X t in V and f\, 3, on the boundary. If the bars 
were removed, these would be the same equations that govern the static 
equilibrium of an elastic body of the same geometry. If the solution of the 
latter were known, the transformed solution would be obtained, and the 
solution of the original problem could be obtained by an inversion. 

The counterpart of Young’s modulus E, the shear modulus G, the bulk 
modulus K, Poisson’s ratio 1/, and one of Lame constants A. (the other 
Lame constant is /i = G) can be derived from Eqs. (3a), (3b), and (3c), 
and are 



( 8 ) 
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Pig. 15.5:1. Boussinesq problem over a viscoelastic material. 


( 0 ) of.) - Pi( s )Q 2 (s) - Qi(s)P 2 (s) = G 2 (s) - G\(s) 
K> Qi(s)P 2 (s) + 2P 1 (s)Q 2 (s) 2G 2 (s) + G 1 (s) 

= Ms) - J 2 (s) 

2J\(s) -f J 2 {s) ’ 

(10) Ms) = K(s) - 2 -G{s) = ls[G 2 (s) - G 1 (s)]. 


Example 1 . Boussinesq Problem 

Consider the problem of a concentrated load Z on a half-space 
(Fig. 15.5:1). The elastic solution is given in Sec. 8 . 10 . For example, the 
stress component a rr is given by Eq. (8.10:7) as 


( 11 ) 


r(r ’ z) = 2 ^ 





z 1 
r 2 ( r 2 + * 2 ) 1/2 


3 r*z 


( r 2 + *2)5/2 


On applying the correspondence principle, the solution of o rr is 


( 12 ) 


c7 rr (r , z; s) 


Z(s 


2t r 
— 3r^z 


|[l- 2 i/(s)] 

_I_1 

(■r 2 + z 2 )V 2 / ’ 


[r 2 r 2 (r 2 -f 2: 2 ) 1 / 2 


where u(s) is given by Eq. (9). An inverse Laplace transformation gives, 
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therefore, 

(13) o rr (r, z\ t) = 

where 

(14) 


2n ^ |_7* 2 r 2 (r 2 -j- z 2 ) 1 / 2 j 
x ^ Z(t - t)$(t) dr - 3r 2 z - 2 -~ 2)6/2 Z(t )} 

/•c+ioo 

$(t) = / [1 -2P(s)]e st ds 

*/ c—too 


is the inverse Laplace transformation of 1 — 2v(s), the constant c being any 
number greater than the abscissa of convergence. 

Problem 15.9. A concentrated load Z(t) = Zol(t) (Zo a constant, 1(f) 
a unit-step function) acts normal to the free surface of a half-space of Voigt 
material, for which 


Show that 


and 


a'j = (2rjD 4- 2G)eC, <jjj = 3 Kej 
3(77 s + G ) 


1 — 2 v(s) = 


Zof 3 
2tt \ 3 K + G 


G + 3 K exp 
1 


3 K + t)s + G 
( 3 K+G 


r 2 r 2^ r 2 +z 2y/2 


V 


3 rz 


( r 2 + * 2 ) 5/2 


►i( 0 - 


Problem 15.10. Consider a long, thick-walled circular cylinder subjected 
to a uniform internal pressure pi and uniform external pressure po- 
Fig. P15.10. Regarding this as a plane strain problem, show that the stress 



Fig. P15.10. 
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distribution in the cylinder is the same whether the cylinder material is linear 
elastic or linear viscoelastic. Derive the history of deformation u,(f) if the cylinder 
is viscoelastic. 

Note: In the theory of elasticity, the biharmonic equation governing the Airy 
stress function in plane stress or plane strain problems is independent of the elas¬ 
tic constants if the body force vanishes. Note, however, that if the boundary 
conditions of a two-dimensional problem involve displacements, then the associ¬ 
ated elastic problem will involve elastic moduli and the correspondence principle 
applies. 


15.6. RECIPROCITY RELATIONS 

In the beginning of Sec. 1.1, we derived the Maxwell and Betti-Rayleigh 
reciprocity relations. These relations can be generalized to an elastic or 
viscoelastic continuum and, with proper interpretations, they are valid not 
only for statics but also for dynamics. 

The most general reciprocal theorem in dynamics was asserted by 
Horace Lamb 15 3 to be derivable from a remarkable formula established 
by Lagrange in the Mecaniqlie Analytique (1809) by way of a prelude to 
Lagrange’s theory of the variation of arbitrary constants. Lamb showed 
how the reciprocal theorems of von Helmholtz, in the theory of least action 
in acoustics and optics, and of Lord Rayleigh, in acoustics, can be derived 
from Lagrange’s formula. Rayleigh 15 ' 3 extended the reciprocal theorem to 
include the action of dissipative forces, and Lamb 15 ' 3 showed the complete 
reciprocity relationships in a moving fluid with reversed-flow conditions. 

In the theory of elasticity, a generalization of the reciprocity theorem 
to dynamic problems was given by Graffi, 15 ' 3 and certain applications of 
Graffi’s results to the problem of elastic wave propagation were pointed out 
by Di Maggio and Bleich. 15 ' 3 

We shall derive the dynamic reciprocity relationship for a viscoelastic 
solid, which includes the linear elastic solid as a special case. We assume 
the stress-strain law to be given by one of the equations (15.3:4). The 
equations of motion and boundary conditions are given by Eqs. (15.3:3) and 
(15.3:8) or (15.3:9), respectively. Let us consider the problems in which the 
body force Xi(xi,X 2 ,X 3 ;t), the specified surface tractions fi(xi,X 2 , x-.i] t) 
and the specified displacements gi(x\, x?, X 3 ] t ) are given functions of space 
and time, which starts its action at t > 0, under the initial conditions 


Ui = 


duj 

dt 


d N Ui 

dtN 


= 0 


when 


t < 0, 


( 1 ) 
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where N is either unity or equal to the highest derivative that occurs in 
the stress-strain law (4c). We apply the Laplace transform with respect to 
the time t to every dependent variable, assuming that the transforms exist. 
Let the Laplace transform of u be written as u, 

(2) u = f e~ st u(t)dt. 

Jo 

Similarly, a bar over other variable denotes the Laplace transform of that 
variable. Then, we have, due to the initial conditions indicated above, 

s 2 ptii = Xi + ffu j in V , 

(3) 

&ij Vj = fi on S„ 

Hi — Qi on 5^ 


where A (s) and G(s ) are given by Eqs. (15.5:10) and (15.5:7), respectively. 

Now consider two problems where the applied body force and the sur¬ 
face tractions and displacements are specified differently. Let the variables 
involved in these two problems be distinguished by superscripts in paren¬ 
theses. Then, 


(4a) 

(4b) 

(5a) 

(5b) 

(6a) 

(6b) 


In V : s 2 pu\ l) = X[ l) + a[f tj 
s 2 pu \ 2) = X- 2 ^ + 

On S a : a { Xv 3 = f ( % l) 

= fT 

On S u : = g ( i 1] 


Multiplying Eq. (4a) by u' 2> and Eq. (4b) by u^\ subtracting, and inte¬ 
grating over the region V, we obtain 


(7) 


[ X^ u^dv + f 

Jv Jv 




dv 


L 


Xj 2) u^dv 




d- fiu^dv. 
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( 8 ) J v °i]\j u ¥ )dv = J v i(* u kl s ij),M 2) + + Gu j]i),j u i 2) } dv 

= [ Xu^u^i'idS- f A u^uf^dv 

J a JV 

+ J Gu ( ^ j uf ) u ] dS + J GufJu^UjdS 

- J Gu\jUijdv - J Gu^Ju^dv. 

A similar expression is obtained for the integral 

(9) J^d^u^dv. 

When these expressions are substituted into Eq. (7), we see that a number 
of volume integrals cancel each other. The surface integrals contributed by 
the integrals (8) and (9) to Eq. (7) are 

J + J Gu\ju\ 2 ^i'jdS + J Gu^Juf^VjdS 

(10) -{^J Xu^u^VidS + J Gu ( ^u\ l) i/jdS + J Gu (2) l u\ l) v J ds'j 

= f dlpu^UjdS - f dlfu^fjdS. 

Js Js 

Recall that S a is the part of S over which surface tractions are specified, 
S u is the part of S over which displacements are specified, S = S„ + S u . 
Now, substituting Eqs. (8), (9), (10) and the boundary conditions (5a), 
(5b), (6a), (6b) into Eq. (7), we obtain finally: 

( 11 ) J X^u-pdv + J f^u^dS + J d^gf^VjdS 

= X[ 2) u\ 1) dv + Js f- 2) u\ l) dS + Js d (2) g\ l) VjdS. 

This is the general reciprocal relation in the Laplace transformation form. 
If we remove the bars and consider the variables to be in the real-time 
domain, then it becomes Betti’s reciprocal relation in elastostatics. 
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Since the inverse transform of the product of two functions is the con¬ 
volution of the inverses, we obtain: 

( 12 ) f f X$ l \x, t — r)drdv 

Jv J 0 

+ [ [ fi'\x,t - T)uf^(x,T)d.TdS 

Js 0 Jo 

+ j J trjp (x,t - r)gj 2) (x,r)i/jdT dS 

X^\x,t — T)u^\x,T)drdv 

+ f f fP(x, t - t)u\ 1] (x, r)dr dS 
J So Jo 

+ [ f Pf{x, t - r)g\ (X , T)vjdT dS . 

Js u Jo 

This is the general reciprocal relation for elasto-kinetics. Whether the ma¬ 
terial is viscoelastic or purely elastic makes no difference in the final result. 
Note that this result holds for variable density p(x 1 , 22 , 2 : 3 ) and nonhomo- 
geneous material properties. 

Generalization to Infinite Region. A generalization of the above result 
to an infinite or semi-infinite region is possible. Since with a finite wave 
speed there always exists a finite boundary surface, at any finite t > 0 , 
which is yet uninfluenced by the loading initiated at t = 0. Let S u be such 
a surface, Then = 0 on S u , and the remainder of the equation holds 
without question. 



Examples of Applications 

(a) Space-time-separable body forces, surface tractions and displacements. If 


X^ =E^(x)h(t), 

XP = S j 2) (x)h(t), 

ii 

(h 

fP = Pp{x)h{t), 

gP = WP(x)h(t), 

II 

'h 


then Eq. ( 11 ) can be written, on cancelling h(s) from every term, as 

j E\ l) u[ 2 ) dv+ J p( l) u\ 2) dS + J W^d^VjdS 

= [ E {2) u\ 1] dv+ [ Pp ) u ( i l) dS+ f Wpo^VjdS. 

Jv Js„ Js u 3 



Sec. 15.6 


RECIPROCITY RELATIONS 


511 


The inverse transformation gives 

j E\ 1) u\ 2] (x,t)dv + J pVu\ 2 \x,t)dS 

+ f ) (*> t)Wj® VjdS , 

•* S u 

= [ 3? ) u ( i 1) (x,t)dv+ f P t ^u^(x, t)dS 
Jv JSv 

+ / 

Js u 

Graffi’s well-known formula results if — g\'^ — 0 on S u . 

(b) Forces applied at different times. If 

X™ = S\ l) ( x)h{t -TO, Xp> = S< 3) (*)fc(t -T 2 ) 

/f } =ft (1) (x)/i(t-T0, fP =P? ) {x)h{t-T 2 ), 

9^=0, g\ 2 ^ — 0, 

where h(£) = 0 for t < 0, then on substituting the above into Eq. (11) 
and taking the inverse Laplace transform, one obtains 

/ s\ 1 \x)u<?\x,t~T 1 )dv+ f P^\x)ul 2 \x,t — Ti)dS 

Jv Js a 

= / si 2 ) (*)tt| l) (*,<-ra)rfi;+ [ Pp\x)ul l \x,t-T 2 )dS. 

JV J So 

(c) Concentrated forces. If the loading consists of concentrated loads F, (1) 
and f=) (2) acting at points pi,p 2 respectively, we may consider E< or P* 
of the Cases (a) and (b) above as suitable delta functions and obtain 
at once 

F^(p>M 2) (Put- TO = FP(p 2 )u?\p 2 ,t- T 2 ), 

or, if Tj = TO 

^ (1) (Pi)«j 2) (Pi,0 = Fi 2) {V2)Ui\p2,t). 


This is the extension of the conventional elastostatic Betti-Rayleigh 
reciprocal relation to kinetics. 
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(d) Impulsive and traveling concentrated forces. Let an impulsive concen¬ 
trated force act at a point pi, 

X| x) =F$ 1) 5{pi)6{t) l 

where 5(t ) is a unit-impulse or delta function, and let a concentrated 
force be applied at the origin at t = 0, and thereafter moved along 
the x\ axis at uniform speed U: 

= F^S^t - ~pj6(x 2 )6(x 3 ). 

No other surface loading or displacement is imposed. Then Eq. (12) 
gives 

F^u\ 2 \pi,t) - JJJ8(x^)S(x3)dxidx2dx 3 

x / S^t - - r)dr 

and therefore, 

F^U™ ( Pl , t) = F} 2) p /U U j 1} (x! Mt-jjr'jdXL 

If (aq,0,0,t - X\/U) is known, then u\ 2 \ P i,t) can be found from 
the above equation. 

(e) Suddenly started line load over an elastic half-space Ang 9 2 considered 
the problem of suddenly started line load acting on the surface of a 
half-space. Now, according to the reciprocal theorem Ang’s problem 
can be solved by one integration of the solution of Lamb’s problem: the 
impulsive loading at one point (not traveling) inside a two-dimensional 
half-space. Only the surface displacement due to the point loading 
needs to be known. 


Problem 15.11. The equation of transverse motion of a string stretched 
between two points is 


. d^ui __ 
dx 2 dt 2 


= F{x,t), 


0 < x < L. 


Let (x,t) be a dynamic loading corresponding to a solution 
t); F w (x,t) be a dynamic loading corresponding to a solution w^(x,t). 
Generalize the dynamic reciprocity relationship to the present problem by inte¬ 
grating over the length 0 < x < L as well as over the time (0, t). 
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Next consider the specific case in which F^\x,t) is an impulsive concentrated 
load acting at x : F^(x,t) = 8(x)S(t ); and F (l) (x,t) is a traveling load F (1) (x,t) 
S(x — Vt), where V is a constant. Show that w^(x,t) can be derived by the 
reciprocity relation from the solution w^(x,t) corresponding to F (2) (x, t). 



16 


LARGE DEFORMATION 


In Chapter 4, we discussed briefly the different strain measures for large 
deformation. In this chapter we shall examine further the consequences of 
the finiteness of strain fields. The generalization from infinitesimal strains 
to the nonlinear field theory of finite strains opens up a tremendous vista 
of the theory. The nonlinear field theory associated with finite strain is 
difficult and extensive. In this book, we cover only the theory needed for 
the development of numerical solutions of three-dimensional elastic solid 
bodies including plates and shells. In this chapter, the strain measures are 
developed further, the stresses conjugate to various finite strain measures 
are defined, the equations of motion are derived, some constitutive equa¬ 
tions are discussed, and the variational principles for large deformation are 
presented. 

16.1. COORDINATE SYSTEMS AND TENSOR NOTATION 

In this chapter, rectangular Cartesian coordinates in three-dimension 
will be employed to formulate the large deformation theory unless otherwise 
specified. Occasionally we use orthogonal curvilinear coordinates such as 
cylindrical and spherical coordinates in examples. In that case, it will be 
clearly specified. We shall use a combination of the matrix, and indicial 
notations. Normally, the Roman indices have a range from 1 to 3, and the 
Greek indices from 1 to 2. Boldface characters represent vectors, tensors or 
matrices. To simplify the discussion in this section only, a lower case bold 
face character denotes a vector , a tensor of rank 1 , and an upper case for 
a tensor of rank 2, unless specifically noted otherwise. Vectors and tensors 
may be expressed in component form: 

( 1 ) U = U/£/, A = AijSjEj , 

in which the vector u and the second rank tensor A in 3-dimensions are 
decomposed into components measured with respect to the unit base vectors 
e/ of a reference Cartesian system of coordinates. If the reference frame is 
transformed to another Cartesian coordinate system with unit base vectors 
ej, the vector and the tensor may be written as 
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(2) v = v^i , B = Bije^j , 

to show their decomposition with respect to the new frame of reference. 
One may view e ,ej as a tensor product of two vectors resulting in a tensor 
of rank 2. In general, e t ej ^ e^-e; if i ^ j, i.e., tensor product is not 
commutative. The unit base vectors of the two systems are related by 

(3) £/ = , 

in which the determinant of the matrix ( Rjj), denoted as det(R), is equal 
to 1. 

In cylindrical coordinates (r,6, z), a vector v can be referred to a set of 
orthogonal unit base vectors (e r ,eg,e z ) in the form 

(4) v = u r e r + v 6 eg + v z e z , 

Here (v r ,vg,v z ) are the physical, not tensorial, components of v (see 
Sec. 2.14) because (e r ,eg, e 2 ) are unit vectors. The base vectors of cylindri¬ 
cal coordinates are related to the base vectors (ei,e 2 ,e 3 ) of the Cartesian 
frame of reference through 

e r = ei cos# 4- e 2 sin#, eg = — ej sin# 4- e 2 cos#, e 2 = e 3 . 

According to the notation of Eq. (3), we have 

R rl = Rg 2 = cos#, R r 2 = —Rei = sin#, 

B r 3 ~ Re3 — R Z 1 = &z2 — 0 ) Rz3 = 1 • 

The coordinates of the two systems are related by 

xj = x = r cos #, X 2 = y = r sin #, x 3 = z . 

In spherical coordinates, the unit base vectors (e r ,e^,B$) are related to 
the base vectors (ei,e 2 ,e 3 ) of the Cartesian frame by: 

e r = (ei cos # + e 2 sin #) sin <f> + e 3 cos <p , 

= (ei cos # + e 2 sin#) cos <f> - e 3 sin <f>, 

Be = —ei sin # + e 2 cos #. 

The coordinates (r,<j>,8) and (xi,x 2 ,x 3 ) are related by 

Xi = x = r sin <f> cos 9, X 2 = y = r sin <j> sin #, x 3 = z — r cos (j> . 
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In Cartesian coordinates, the physical and tensorial components are the 
same. Similarly one can find the decomposition of tensors in cylindrical 
and spherical coordinates. The components will be physical rather than 
tensorial, since the base vectors are unit vectors. 

The two indices of A may be referred to different coordinate systems, 
e.g., A = Anei£j. In this case, A is called a mixed or two-point tensor 
of rank 2 as opposed to a uniform tensor for which all indices refer to 
the same coordinate frame. One may similarly define mixed tensors of 
higher rank , for which each index of a tensor associates with a different 
coordinate system. 

We summarize the basic vector and tensor notations and operations 
as follows: 

• If B(= is a tensor of rank 2, then B T (= B^e^j) is the transpose 

of B. If Bij = Bji , B r = B and B is symmetric. If B(= Bije^ej) is a 
two-point tensor, then B T (= Bijejef). 

• If A(= Ajjeiej) and B(= Bije^j) are tensors of rank 2, then AB(= 
AijBij£i£jeiej) denotes a tensor of rank 4. 

• e, ■ ej = Sij is the dot product of two orthogonal unit base vectors, where 
Sij is the Kronecker delta. 

• With u = u/e/, v = Vje,, and £i — Rijej, then the dot product of the 
two vectors u and v, which involves the dot products of base vectors, is 
u • v = ujVj(£j ■ ej) = ujRjjVj. Note that u ■ v = v • u. If £j — e, for 
I = i, one has Rjj = then u-v = UjUj. For u = e*, one has ei-v = v t , 
which is called the projection or component of v along e,. 

• If B = Bije^ej and v = v,ei, then the dot product of the vector and the 
second-rank tensor is a vector defined by B ■ v = B^v^ei and v • B = 
VkBkiei. Thus, B ■ v and v • B are vectors with components BikVk 
and VkBki , respectively. In general, B^Vk f=- VkBki, i.e. B • v ^ v • B. 
One can similarly obtain the dot products of vectors and tensors whose 
components refer to different coordinate systems. 

• If A = A/j£/£j, B = Bijeiej, and £/ = Rijej, then the dot product of 
the two second rank tensors is defined as A ■ B = AijB ij £i(£j ■ e»)e 3 - = 
AijRjiBijEiej. A • B is a tensor of rank 2. In general, A B / B • A. 
Let C = A • B = CjjEjej, where C[j = AijRjiBij. C is a two-point 
tensor whose first index refers to a coordinate frame with base vectors £/ 
and the second index refers to e } . If ej = e, for J = i, one has Rji — 6ij 
and C — A B = AikBkjeiej. Then C is a uniform tensor of rank 2. One 
can generalize the dot product for two tensors of arbitrary rank: A • B 
involves the contraction of the last index of A and the first index of B. 
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The resulting product A • B is a tensor of rank r A + r B — 2, where r A , 
tq are the ranks of A and B, respectively. Note that a vector is a tensor 
of rank 1 and a scalar is a zero rank tensor. 

• If A is an arbitrary tensor of rank 2, then A 2 = A • A. 

• The trace of a uniform, tensor A of rank 2 is a scalar defined as tr(A) = 
An. 

• The inner product of two tensors A, B of rank 2 is a scalar defined as 
A : B = tr(A • B T ). For example, if A = Aije^j, B = Bije^j, are 
two uniform tensors of the same reference frame, then A : B = AijBij. 
If A = A[j£[£j, B = Bijetej, then A : B = AuBij{£[ ■ e»)(£j • e^) = 
AuRjiRjjBij. If A = AijeiEj andB = Bije^ej , then A : B = AuBij. 
Since tr(A • B T ) = tr(A r • B), one has A : B = B : A for second rank 
tensors. 

• If A and B are tensors of unequal rank, then the inner product A : B 
involves the contraction of the last index pair of A and the first index 
pair of B, which results in a tensor of rank r A + rB — 4. For example, 
A : B = AuBukiek^n = C for A = AijCjej, B = Bukiej£je k ei, 
where C = C kl e k e u C ki = AijB [Jk t, r A — r c - 2 and r B = 4. Note 
that B ; A = BjjijAKiRiKRjL^iSj. In general A : B ^ B : A. 

• If C = CuijEiejeiej is a mixed tensor of rank 4, and A(= AijEjej) 
and B(= BijOiOj) are tensors of rank 2, then the inner product A : C : 
B = AijCjjijBij is a scalar. 

• For base vectors e< of a rectangular Cartesian frame, the cross product 
Bj xe k = eijk^i is a vector, where ey* is the permutation tensor of rank 3. 
If Eq. (3) holds, then ej x e*, = Rjiej x e k = Rjjeij k ei. One can easily 
determine u x v for u and v defined in Eqs. ( 1 ) and ( 2 ). If £/ = ej for 
I = i, then uxv = eij k UjV k ei. Note that (u x v) x r = (r • u)v - (r • v)u. 

• If u = u/ej, v = v<ey then is a second rank tensor. 

• If A = Ajjeiej, B = Bijeiej, then gjy = e/e je t e } is a fourth rank 
tensor. 

• The del operator is 


d 

~ Gi dxi 

0 eg d 

: er &? + ~~ +ez 


r 06 z dz 
_ 9 _ eg 0 ^e$0_ 


dr r sin 4> 06 


+ 


(Cartesian coordinates) 
(cylindrical coordinates) 
(spherical coordinates). 


• The gradient of a vector u is defined as 
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Vu = 



duj du 

dxj £,e ° dx ' 


which is a tensor of rank 2. In cylindrical coordinates, the gradient of u 


becomes 


_ du 1 du du 

Vu = — e r + - — eg + 

or r 98 dz 


In computing the partial derivatives of u, one must take into account 
that the base vectors of the reference frame for u can be functions of r, 
9 and z. For example, expressing u in the form u = u r e r + uge$ + u z e z 
where e r and eg are functions of 6 , one has 


du 

~96 


du T 

~d9 


e r + u r 


de r dug deg 

"SzT + d" 

dO 98 99 


du z 
99 e 



e r 4- 



eg + 


du z 

96 6 


In spherical coordinates, the gradient of u is 

_ du 1 du 1 du 

vu = -t— e r + — ; — j-^^0 H—• 
or r sin </> cro r oq> 

• The gradient of a tensor is similarly defined as VA = §~ei, which is a 

tensor of rank n + 1 and n is the rank of A. For cylindrical and spherical 
coordinates, one simply replaces gg-ej with the appropriate del operator. 
In Cartesian coordinates, VA = for a second rank tensor A. 

• The divergence of a vector u is defined as V • u = tr(Vu) and is given by 


V • u = ei 


du 

dxi 


dui 

dxi 


in Cartesian coordinates, 

du eg du du 1 dru r 1 dug du z 

U 6 ’ dr+r d9 dz r dr ^ r 98 ^ dz 

in cylindrical coordinates, and 

_ du eg du eg, du 

V • u = e r • — H-—- • H - 

dr rsm<p do r d<p 

1 9r 2 u r 1 dug ^ 1 d sin <j>u^ 

r 2 Q r rsincf) 96 rsincf) d<j> 


in spherical coordinates. The divergence of a vector is a scalar. 
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• If B is a tensor of rank 2 or higher, the divergence of B is 


V B = e< 


SB 

dxi ' 


The dot operation by the base vectors must be executed from the left 
as defined. Note that §~: ' e i ^ e i ■ ■ However, if B is a vector, no 

distinction is needed. For B of rank 2, V • B = 

• The curl of a vector is defined as 


V x u = e ; x 


du 

dxi 


(Cartesian coordinates) 


du 1 du 

x — + e<? x - — +e z 


dr 


r dd 


du 

dz 


(cylindrical coordinates) 


= e r X ——h eg x -7 —- + e,* x - — (spherical coordinates). 

dr r sin <p dO r d<p 

Note that e* x ej — CijkVk- 

• The Laplace operation is defined as V 2 v = V • (Vv) = e* • On 

has V 2 v = o'x" l h:~ Ci Cartesian coordinates. One can similarly 

determine the Laplace operation in other coordinates. 


We shall close the section by identifying some frequently used relations 
of the inner product and properties of matrix operations. The inner product 
of stress and strain rate is the rate at which work is done by the forces acting 
on the surface of a unit cube as the body deforms. It is related to the rate 
at which the internal energy of the material changes by the first law of 
thermodynamics. Hence the inner product defined earlier is a frequently 
used operation. The followings are true: 

A : B = B . A = A T : B T , 

(A ■ B) : C = C : (A ■ B) = (B • C T ) : A T = (C ■ B r ) : A 
^ = (A T - C) : B = (C t -A) : B T = B : (A T - C) 

= A:(C-B t ), 

for any second rank tensors A, B and C. If S is a symmetric tensor, then 
S : B = S . B T = S : (B + B r )/2 = S : (B),, S : (S) 0 = 0, 

where (B)« = (B + B r )/2 and (B)„ = (B — B r )/2 are the symmetric and 
skew-symmetric parts of B, respectively. If A is skew-symmetric, then 

A : B = -A t : B = —A : B T = A : (B — B r )/2 = A : (B)« , 


A : (B) s = 0. 
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Prom time to time, we will use matrix notations and follow the rules of 
matrix operations for scalar, vectors, and tensors or rank 2. In this case, the 
vectors and tensors are referred to the same orthogonal coordinate system. 
The vector (column matrix ) u and tensor A defined in Eq. (1) can be 
written in matrix notation as 



Ui 



An 

A 12 

A 13 

(6) 

u - U2 

= M , 

A = 

A 21 

A 22 

A 23 


U 3 



A 31 

L 

A 32 

A 33 


The vector u as defined in the equation above is called a column vector. In 
matrix notation, the two subscript indices of A’s can have different ranges. 
For example, if the range of first index is from 1 to 3, and the second from 
1 to 4, we write 

An ^12 ^4l3 Ai4 

An A 22 A 23 A 24 ) 

A 31 A32 ^33 A34 

where the first and second numbers in the underscore of A denote the 
ranges of the first and second indices, respectively. In this case, A is a 
3x4 matrix, i.e, a matrix with 3 rows and 4 columns. Note that a column 
vector of rank p is a “p x 1 ” matrix and a row vector of rank p is a “1 x p" 
matrix ( row matrix). When the two ranges of the indices are equal, A is 
called a square matrix. For simplicity, we drop the underscore unless it is 
needed for clarity. We implicitly assume the ranges of indices are known. 
The transpose of a column vector u is a row vector, i.e., 

U T — [iti u 2 u 3 ] , 

which is different from the tensor notation where the transpose of a vector 
is the vector itself. The transpose of A is A T and is defined as follows: if 
C = A 7 , then the components Cy = Af,. If Aij = Aji, A is symmetric. 

If u is a column vector and A is matrix, their products are defined as 
follows: 

Au = [AijUj\ , u t A = [uiAij] , 

where repeated indices denote summation over appropriate ranges. In the 
first case, the number of columns of A must be the same as the number 
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of rows of u. In the second case, the number of rows of A must be the 
same as the number of rows of u. Note that the components of Au (matrix 
multiplication) equal those of A-u (inner product) and those of u T A equal 
u • A in the tensor expression. If A , B are two nonsquare matrices with 

mxn nxp 

their ranges of indices indicated by the underscores, their product is 


C 

mxp 


A B — [AijBj 

rnxnnxp 


which is the same as A • B in tensor notation. Note that the number of 
columns A must be the same as the number of rows of B. Finally, if A is 
a square matrix, the inverse of A is A -1 such that 


AA 1 = A l A = I, 


where I is an identity matrix whose diagonal terms are all 1 and the off- 
diagonal terms are zero. 


16.2. DEFORMATION GRADIENT 

Consider a body which deforms from a volume V in the original or 
undeformed configuration Ro into a volume v in the current or deformed 
configuration R. Let x denote the position of a point in R associated with 
a material point originally located at X in Ro- Then we have 

x - Xie { , X = Xjei , 

where ej and £/ are the unit base-vectors in R and Ro, respectively. The for¬ 
mer, e», is called the Eulerian base-vector and the latter, e/, the Lagrangian 
base-vector. The base vectors may translate in the Cartesian frame, but 
are not allowed to rotate. As a rule, we use capital indices referring to the 
undeformed coordinates and the lower case indices for the deformed coor¬ 
dinates. We assume the mechanics to be Newtonian. The motion of the 
body is described by the relation 

(1) x = x(X,£), 

where t is time and x is continuously differentiable with respect to X and t. 
To find the transformation between line elements in R and Ro, we consider 
a material line element dX at X, which becomes dx after deformation. The 
relation between the two elements is then 
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where Vx denotes the gradient operation with respect to the X coordinates, 
and F is called the defoi'mation gradient tensor defined as 

(3) A F = — = Fueiei , or Fu = dxi/dXi , 

The first index of Fu refers to an Eulerian and the second index to a 
Lagrangian basis. F is a second-rank tensor related to the Eulerian and the 
Lagrangian coordinates, and is therefore a two-point tensor. Equation (2) 
can be inverted to give 

(4) dX = F -1 •dx 

providing that the Jacobian J = det[F(X, t)} is nonsingular, i.e., ./ £ 0 . 
The Jacobian is denoted also commonly in alternative forms 


(5) 


J = det[F] - det 


dxi 

<9X7 


- det 


dxi dxi dxj dxk 

W, ^^dX^dX^dTi' 


The inverse of F can be written as 


( 6 ) 



or 


(Fu )- 1 


dXi 

dxi 


Stretches of Line, Area, and Volume. We shall show how line, area, 
and volume elements in R are related to their counterparts in Ho through F. 
First, the well-known Cauchy and Almansi tensors C and B -1 , respectively, 
are defined as 


(7) A 

( 8 ) A 


c = f t f , 


B ” 1 = (F -1 ) r • F -1 , 


or 


or 


IJ = W^dTj' 

! dx K dX K 

lJ dxi dxj 


They both are symmetric and positive definite. Clearly C is a Lagrangian 
based tensor because the Eulerian indices i have been contracted; whereas 
B -1 is an Eulerian based tensor as a result of the contraction of the La¬ 
grangian indices K. 

Now we can establish the stretching of line elements under deforma¬ 
tion. Consider an infinitesimal line element dx(= sds) at (x, t.) in R and its 
corresponding line element dX(= S dS) at (X, t) before deformation, where 
ds and dS are the lengths of the line elements before and after the 
deformation, and s and S are unit vectors along the directions of dx and 
dX. ds/dS is called the stretch ratio of the element. Denoting it by A.,, we 
can show that 


ds 

dS 


VdX ■ F T ■ F • dX 
|dX| 


y/S ■ C ■ S , 


(9) A 


A, 
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for dS —> 0. We can also write 


( 10 ) 


ds _ |dx| _ 1 

dS \Zd~K ■ B _1 • dx \/s - B _1 • s 


Next, consider the differential geometry of a surface in the region R. 
Let dx^ and dx^ be two infinitesimal noncollinear line elements lying 
in the deformed surface and let dX^ and dX^ be the corresponding 
line segments in the undeformed configuration. The cross product dx^ x 
dx^/\dx^ x dx( 2 ^ | is a unit vector normal to dx^ and dx ^. We define 

da. = dx^ x dx^ = n da, or n^da = e i jkdXj 1 ' > dx^ , 


that da is the area of an element in the deformed configuration with n 
being a unit normal to the plane of dxU) and dx^ 2 L The corresponding 
area element in the undeformed configuration is 


(11) dA = dX (1) x dX (2) = NdA, or JV/dA = e IJK dX { j l) dX^ , 

where N is a unit normal to dX (1 ) and dXS 2 \ 

We shall establish the relation between da and dA. Expressing dX's in 
terms dx’s yields 

N,dA = dxfdxf . 

Multiplying both sides of Eq. (11) by JF -1 (or JdXjjdxi) and simplifying 
the result by making use of Eq. (5) and the definition of determinant 


e ij r det 


dX L 

dx m 


dXj dXj dX K 

eiJK dxi dxj dxk ’ 


we obtain 

(12) J(F -1 ) t • dA = da or 

t 9Xm m j a r dX L 3 Xm 9Xn (i)j (2) j 

J —— N M dA = Je L MN-z -x-x— dx) 'dx\ - ruda. 

axi dxi dxj axk 3 

The details of the proof are left as an exercise. Using Eq. (7), yields 
|da| = JsJdA. ■ F _1 • (F -1 ) T • dA = JVdA • C - 1 • dA. 
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The area stretch ratio is then 


(13) ▲ Ajv 
S imilarly 

(14) A 


|da| V dA ■ C ~ 1 • dA 

|Za[ = jdAj 


= JVN-C - 1 -N. 


A n = JM = J M a l = J 

jrfAj s/da. ■ B • da %/n B n 


Note that A fj — A n . Prom Eqs. (12)—(14), one obtains 


|dA| J(F -1 ) t • dA 
\da\ |dA| 


(F~ 1 ) t -N- \/nBn = 


(F~*) T - N 
-\/N • C~ x • N ’ 


which is the relation between the normal to da and that to dA. 

Finally, we shall determine the relation between the deformed and un¬ 
deformed volumes. The volume of the element tetrahedron dv formed by 
the infinitesimal line elements dx' : '. dx' 2 ' > and dx^ 3 ^ in R is 


(15) dv = \ ( dx ^ x dx^) ■ dx ^ . 

6 

It is related to its corresponding undeformed volume dV by 

(16) A = det(F). 


Details of the proof are left as an exercise. Let p and po be the mass 
density per unit deformed and undeformed volumes, respectively. Prom 
conservation of mass, we have 


(17) 


J = p 0 /p. 


Problem 16.1. Consider an area element vector da[= rlx f 1 ' x dx^ 2 ' 1 } in the 
deformed configuration R. The corresponding area vector in the undeformed 
configuration is dA[= dX (!) x dX (2) j. Show that 

(18) da = JdA'F -1 = J(F"') T dA. 


Problem 16.2. Show that Eq. (16.2:16) holds for the volume ratio of the 
deformed to the undeformed configuration. Hint: To determine the relation¬ 
ship between area elements and between volume elements, one needs to use the 
permutation symbols and identities defined in Chapter 2. 

e%jk = (e> x e^) ■ e*,, ejjK — (e/ x ej) • e/e , e ( x ej = , 

cklnJ — e/cr,jvdet(F) = e ijm FiKFjLF m N , 
eKLNiFiNy'dX^dX'i' 1 = (dx m x dX (2) ) -F _1 . 
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16.3. STRAINS 

As discussed in Chapter 4, strain is a measure of deformation. In this 
section, we shall discuss various strain measures and the relations among 
them. Let ds 2 (= dx ■ dx) be the length square of the line element dx in R 
and dS 2 (— dX • dX) be that of the corresponding undeformed element dX 
in Rq. They can be expressed in terms of the deformation gradient: 

(1) ds 2 —dx • dx=dX • F T • F • dX=dX ■ C ■ dX=C /J dX / dX J , 

(2) dS 2 =dX ■ dX = dx • (F- 1 ) 7, • F' 1 • dx=dx • B' 1 • dx= B^dxidxj. 

If the Lagrangian coordinates are used to describe positions of material 
particles in the undeformed body, then on allowing the metric tensor to 
become C, the deformed body is obtained. Hence, the deformation is like a 
transformation from an Euclidean geometry into a non-Euclidean geometry 
with metric tensor C. On the other hand, if the Eulerian coordinates are 
used to describe positions in Ro, then B _1 is the metric tensor that turns 
the deformed body into the undeformed body of non-Euclidean geometry. A 
necessary condition (also locally sufficient) for a symmetric positive definite 
metric C to be derivable from a configuration x is the vanishing of a fourth- 
rank tensor field called the Riemann-Christoffel tensor [See Problems 2.24, 
2.31 and 4.9; and Green and Zerna (1968), Eringen (1962)]. The condition 
is called the compatibility condition, (see Sec. 4.6). 

The Green strain tensor E and the Almansi strain tensor e are de¬ 
fined by 

(3) A ds 2 - dS 2 = 2dX ■ E • dX = 2E IJ dX I dX J , 

(4) A ds 2 - dS 2 = 2dx -e ■ dx — 2eijdxidxj , 

where 

(5) A E = I(C-I), Ej J = ±(C I j-6 IJ ), 

like C, is a Lagrangian tensor referring to the undeformed coordinates; 
whereas 

(6) A e=i(I-B- 1 ), e y = i(^-BHi), 

like B, is an Eulerian tensor referring the deformed coordinates. 

We shall examine the characteristics of the various deformation mea¬ 
sures. We first consider C of Eq. (16.2:7). Since C is positive definite, its 
eigenvalues can be written in the form A?, i = 1,2,3. We call the directions 
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of the eigenvectors the principal directions. Consider a point X + dX in Rq 
which is displaced to x(X + dX, f)[= x(X, f) + dx] inf?. Using Eqs. (16.2:2) 
and (16.2:7), we obtain 

dx-dx = dX-C-dX + 0(|dX| 3 ). 

If X + dX is a point on a small ellipsoid centered at X with the semi-axes 
r/X t oriented along the principal directions of C. Then 

dx ■ dx = dX • C • dX = r 2 , 

which indicates that the image of an ellipsoid in R a is approximately a 
sphere of radius r in R. Expressing dX in terms of dx leads to 

dX ■ dX = dx ■ B -1 dx. 

If dX-dX = r 2 , the equation above is an ellipsoid of the principal semi-axes 
rXi for dx as the eigenvalues of B _1 are 1/A 2 . Thus, the image of a sphere 
of radius r in R a is approximately an ellipsoid of the principal semi-axes 
rXi in R. For line element dX along the principal direction associated with 
eigenvalue A», we have 

dx = A idX. 

Therefore, Ai, i = 1 ,2,3, are called the principal stretches and the ellipsoid 
in R is called the strain ellipsoid. 

16.4. RIGHT AND LEFT STRETCH AND ROTATION TENSORS 

Another useful strain measure is U defined as 

(1) U=C 1/2 . 

If N 1 is an eigenvectors of C with A 2 being the corresponding eigenvalue, 
then N" is also the eigenvector of U corresponding to the eigenvalue A,. 
Hence C and U are coaxial. Clearly, like C, U is also symmetric and 
positive definite that satisfies 

(2) ▲ U • U = C = F T ■ F . 

Let 

(3) R = F ■ U -1 . 

We can easily show that 

R t • R = (IT 1 ) 7 • F T • F • U -1 = (U- 1 ) ■ U ■ U ■ U " 1 = I, 
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therefore, R is a rotation tensor. Equation (3) gives 
(4) A F = R • U, 

which is called the right polar decomposition of F, and R and U are called 
the rotation tensor and the right stretch tensor, respectively. 

From Eq. (4), we can interpret the deformation from dX to dx as a 
two-stage transformation and write Eq. (16.2:2) as 


(5) 

dX* = U • dX, 

(6) 

dx = R dX* 


In other words, the line element dX is first stretched and rotated by U to 
become dX*. (For line elements along the direction of an eigenvector of 
U, the deformation involves stretching only.) Then the line element dX* 
is rotated by R to become dx. The second stage deformation involves no 
stretching. Such a description of the deformation of dX is unique only to 
a rigid body translation. 

The deformation gradient F can also be decomposed into 

(7) A F = V• R, 

where V is a symmetric tensor and satisfies 

(8) V V = V R R r V T = F F t = B . 

Equation (7) is called the left polar decomposition and V is called the 
left stretch tensor. Note that B and V have the same eigenvectors, and 
therefore are coaxial. Clearly V is positive definite and is linked to U of 
the right polar decomposition by 

V = F R t = R • U • R r . 

Writing Eq. (16.2:2) in the form 


(9) 

dx' = R • dX, 

(10) 

dx - V ■ dx', 


we can interpret that the deformation first rotates dX as a rigid body by R 
to form dx', then stretches and rotates it by V to form dx. If dx' is along 
a principal direction of V, it is only stretched to become dx. 
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If Aj is an eigenvalue of U associated with the eigenvector N‘, then 
U ■ N ! = AjN 1 (i not summed), 


and 


A i R-N i -R r = R-U R T -R.N i R :r = V-R-N i .R T , (i not summed), 

which shows that A* is also an eigenvalue of V associated with the eigen¬ 
vector 


(11) n' = R • N* • R t . 

In other words, V and U have same eigenvalues, and their eigenvectors 
are linked by Eq. (11). Note that N l and n* are, respectively, also the 
eigenvectors of C and B associated with the eigenvalue A?. 


16.5. STRAIN RATES 

We shall now define the time rates of the strain measures discussed in the 
previous section. The Eulerian descriptions of the velocity field v = v(x, t) 
and the velocity gradient L at time t are defined as 


( 1 ) 

( 2 ) 


v(x, t) = x(X, t) — 


dx(X,t) 

dt 


L 


d\ 
dx ’ 


or Lij = 


dvi 

dxj 


Then the material time derivative of deformation gradient is 


(3) A 


DF dx dx dx_ 

~Dt ~ dX = 5x dX ~~ ' ’ 

D ( dxi A d±i dvi dxj 
Dt \dX~j) = dX~j = dxj dXj 


or 

Lij Fjj . 


Again F is a two-point tensor. Using F • F 1 = I and F • F 1 + F • F 1 = 0, 
we establish that 


(3a) 


DF " 1 

Dt 


= —F _1 • F ■ F -1 


~F _1 ■ L. 


In general, (F) x (= F 1 • L l ) ^ From Eq. (16.2:7), we find 
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where 

(5) A D = (L + L r )/2 

is the symmetric part of L called the ( Eulerian ) deformation or stretching 
rate tensor. Another useful rate tensor derived from L is 

(6) A O = (L — L r )/2, 

which is skew-symmetric and called the spin tensor. 

Prom Eqs. (5) and (16.3:5), we find the Green’s strain rate 

(7) A E = F T ■ D ■ F. 

The rate of the right stretch tensor can be derived from Eqs. (7) and (16.4:2) 

(8) U'U + U- U = 2E = 2R T • D • F. 

We then obtain 

(9) ~ _ ( L r ■ b -1 b -1 ■ l) . 

nn-1 

(9a) A B = —B • B = B L r + L B . 

Again — ■■ ^ (B)~ l . The time derivative of Eq. (16.3:6) gives the rate of 
the Almansi strain tensor 

(10) A e = D - (e • L + L r • e). 

From Eqs. (3) and (8), R ■ R T + R • R r = 0 and F = R U + R U we have 

(11) R = w • R. U = R r • (L - u;) • F, 
where a; is a skew-symmetric tensor. 

16.6. MATERIAL DERIVATIVES OF LINE, AREA AND 
VOLUME ELEMENTS 

In Sec. 16.2, it was shown that dx, da, and dv in R are related to their 
undeformed counterparts dX, dA and dV in Rq through F. In the following, 
we shall show that the stretching rate tensor D defined in Eq. (16.5:5) 
relates the material rate derivatives of dx, da, dv to those of dX, dA, dV. 
The time rate of a line element dx is 



530 


LARGE DEFORMATION 


Chap. 16 


in which the first equality is the definition of the dot, the second equality is 
true because dX. is independent of t, whereas the third equality introduces 
the velocity gradient tensor defined in Eq. (16.5:2). The stretch rate of the 
element is 

(1) Is 3 Mx| = r^^-Vdx ■ dx 

K ' 3 |dx| Dt 1 1 |dx| Dt 

dx ■ dx + dx • dx dx-D ■ dx _ ^ 

= 2|d^|2 = |dx| 2 = S D S ' 

where s = dx/\dx\ is the unit vector along the direction of dx. Since 
ds = \/dx- dx — |dx| and ds/dS — A s [Eq. (16.2:10)], it follows that 



where dS is the undeformed length of ds and is independent of t. The 
equation above gives 

(3) A s = A J 3 = A s s • D • s = A„s • L • s = A s s • L T ■ s. 

Using Eq. (1) we can show that 

< 4 > i = js|-' iI ‘w = L ' s “ (s ' D ' 5 ) s = (L ''' I) ' s ' 

The quantity lnA„ is often called the natural or logarithmic strain. 
Equation (2) establishes that the stretch rate of a deformed element is 
equal to the rate of logarithmic strain. In Eq. (1), taking s = e* leads to 
l s = Du (i not summed). Thus, with respect to Cartesian coordinates, the 
diagonal components Du, D 22 and D 33 are the logarithmic strain rates 
along the directions of base vectors ei, e 2 and e 3 , respectively. 

To determine the meaning of the off-diagonal components of D, we 
consider two infinitesimal line elements dx^ 1 ^ and dx^ along the directions 
s^> and , respectively. The dot product of the two line elements gives 

dx^ • dx^ — |dx^| |dx^| COS0 , 

where 0 is the angle between the two vector differentials. Then 
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If dx W is perpendicular to dx (2 \ cos 6 = 0 and sin# = 1. Equating the 
two equations above gives 


( 5 ) 


D0 _ dxW • D • dxW 

Dt Irfx^ 1 ) | |dx( 2 l| 


2s (1) • D • s< 2) . 


But —D6/Dt is the decreasing rate of a right angle, which is precisely the 
shearing rate. If and s^ 2 -* are in the directions of two base-vectors e* 
and e 3 - where i ^ j, then • D • = D,j. 

The rate of the volume ratio is determined as follows. Notice that 


cofactor (Fu) = J(F'£)' 


dJ 

dFij 

It is straightforward to find 


OJ rj, . 

m = ■ 


( 6 ) 

Then 
(7) A 


D_ 

Dt 


(£)“ 


DJ_ 

Dt 


dJ 


: F 


dF 


- J(F t )~ 1 : (L • F) 
: tr(L)J = tr(D)J. 


§1„ W = «,( D) = gi=V.v, 


where V(= ei^-) is the Eulerian gradient operator. The divergence of the 
velocity vector is the logarithmic rate of the volume ratio. 

To compute the time rate of the area element da, whose magnitude is 
da, we derive from Eqs. (16.2:12), (16.1:15), (3) and (6) that 


(8) da 


a={ 






dA= [tr(L)I-L T ] - da. 


The use of Eq. (16.2:14) and |da| 2 = da - da leads to 

: _ D |da| _ |da| 1 D|da| _ X n da- da 

n ~ Dt |dA| |dA| |da[ Dt |da| 2 

From Eqs. (1) and (8), the equation above becomes 

(9) A n = A„[tr(D) - n • L T • n] = A„[tr(D) - l a ], 

where n is the unit vector normal to the surface and l s is the logarithmic 
stretch rate of the line element along the normal direction. Then 

D 


( 10 ) 


Dt 


ln(A„) = tr(D )-l. 
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Introducing the Jacobian J = det[F(X, t)J, and using Eqs. (2), (6) and 

(10) , we obtain 

which implies 

(11) J = A a A n . 

Equation (11) indicates that the volume ratio equals the product of the 
stretch ratio and the area ratio provided that the direction of the stretch 
is normal to the area element. 

In summary, the rates of the logarithmic line, area and volume ratios of 
finite deformation are 

■j^ln(As) = sDs, ~ ln(A n ) = tr(D) - n • D • n, ln(J) = tr(D). 
These results reduce to those of the infinitesimal strain in Chapter 4. 


Problem 16.3. Show that 


d£ 

dF 


J( fT 1 , 


where J = det(F). 


and 


9 £ _ r -i 
dE C 


16.7. STRESSES 

The stress tensor introduced in Chapter 3 is valid under large defor¬ 
mation, provided that all quantities are Eulerian referred to the current 
configuration. This stress tensor is called the Cauchy stress and denoted 
by r, which is an Eulerian tensor with components Tij. The rate of work 
done per unit volume by the Cauchy stress acting on the surface of an in¬ 
finitesimal volume element is TijDij, which is the inner product of the stress 
tensor and the deformation rate tensor [see Sec. 12.3, Eqs. (12.3:7,8)]. Let 
p be the density of the material at the deformed state. The rate of work 
done per unit mass by the stress is W = r^Dy/p. Then 

(1) pW = t : D = TijDij = TijVij , 

where D and Vij are given in Eq. (16.5:5) and (12.2:8), respectively. We 
define that a stress is conjugate to (or the work conjugate variable of) a 
strain rate, and vice versa, if their inner product is the work rate per unit 
deformed or undeformed volume. Thus the Cauchy stress r is the stress 
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measure conjugate to, or the work conjugate variable of, the deformation 
rate tensor D when the work rate is referred to the deformed volume. 

Since the mass does not change during deformation, we see that the 
work rate referred to a unit volume of the material in the undeformed 
state is 

(2) p 0 W — ~ pW = Jr : D = Jr^Dy , 

where J[= det(F)] is the Jacobian of the deformation gradient tensor F. 

Chapter 4 and Secs. 16.3 and 16.4 shows that there are other strain 
measures, each has a claim to advantage in certain circumstances. Corre¬ 
sponding to each strain measure is a stress measure whose inner product 
with the time rate of the strain measure gives the rate of work done per 
unit deformed or undeformed volume. Then the strain rate and the corre¬ 
sponding stress measure are said to be conjugate to each other. The stress 
measures are defined as follows. 

Kirchhoff Stress. Kirchhoff defined 

(3) a = Jt , 

now known as the Kirchhoff stress tensor. Obviously cr is symmetric and 
a : D equals to paW, the time rate of strain energy density per unit un¬ 
deformed volume. Therefore, a is conjugate to D when the work rate is 
referred to the undeformed configuration. 

First Piola-Kirchhoff Stress. The first Piola-Kirchhoff stress tensor , 
also called the Lagrangian stress tensor , is defined as 

(4) T = F -1 • <7 = JF -1 • r , 

which is a nonsymmetric two-point tensor with F -1 based on the unde¬ 
formed state and r referred to the deformed state. Since a is symmetric, 
using Eq. (16.5:3), one can show that 

(5) T : F t = (F" 1 • <r) : F T = (F • F~ l ) T : or = <r : L = a : D. 

Thus T : F t equals to poW and, therefore, T is conjugate to F 7 '. 

Second Piolar-Kirchhoff Stress. A widely used stress measure is 

(6) S = T (F -1 ) r = F -1 • a ■ (F -1 ) T = JF -1 • r • (F- J ) T , 
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known as the second Piola-Kirchhoff stress tensor. Clearly S is symmetric 
and is defined in Lagrangian description. It can be shown that 

(7) S : E = [F- 1 • <r • (F“ 1 ) T ] : E = (F -1 • cr) : (E • F -1 ) 

= <r : [(F- 1 ) 7, ■ E • F- 1 ] = <r . D = p 0 W, 

which establishes S being conjugate to Green’s strain rate E defined in 
Eq. (16.5:7). 

Biot-Lur’e Stress. Another stress measure is the Biot-Lur’e stress tensor 
which is defined as 

(8) r* = F -1 <r-R = TR = S- U. 

One can show that 

(8a) S:E = S- U:U = r*:U = r:U, 

where 

r - (r* + r* T )/2 

is the symmetric part of r*. Then the conjugate stress measure of U is r* 
since r* : U = p 0 W. 

Corotational Cauchy Stress. Green and Naghdi (1965) introduced 

(9) <r r = R T • r RJ = U S U 

as the symmetric corotational Cauchy stress. This stress is obtained by 
rotating the Cauchy stress tensor back to the undeformed body. The conju¬ 
gate strain measure is R 7 • D • R that the strain energy per unit undeformed 
volume is 

(9a) cr T : (R r DR) — «r:D = S:E. 

A summary of the relations among the different stress measures is as 
follows: 

(10) ▲ <7 = Jr = F • T = F • S • F t = F ■ r* • R T = R • cr r • R T . 

The hydrostatic pressure is usually defined as the mean Cauchy stress 

P=-(r: I)/3. 
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It follows from Eq. (10) 

_ T:I = ^ l = CT:I = T:FT = S:C = r * :U = r:U 
(li; A p- 3 3 J 3 J 3J 3 J 3J 3J ’ 

which gives the consistent definition of hydrostatic pressure based on dif¬ 
ferent stress measures. Similarly one can consistently define the deviatoric 
stresses for different stress measures, namely 

t" = r + pi = r - i (r : 1)1, 
cr" = cr 4 - pJl = cr - ^ (cr : 1)1 , 

T" = T + pJ(F~ 1 ) r = T - i (T : F T )(F“ 1 ) T , 

(12) S" = S + pJC -1 = S - i (S : C)C _1 , 
r*" = r* +pJXJ~ 1 = r* - i (r* : U)U _1 , 

r" = r -f-pJU -1 = r - i (r* : U)U _1 , 

a" = (Tr + Jpl = Or - ~ {Vt ■ 1)1 • 

Note that F _1 : F 2 — 3. We have 

t" : I = a" : I = T" : F T = S" : C = r*" : U = r" : U = <r" : I = 0. 

(13) 

We have established in Eqs. (5), (7), (8a) and (9a) that the work con¬ 
jugate variables of cr, T, S, r* and cr r are D, F 2 , E, U and R • D • R 7 , 
respectively. Let us define 

P d = f p 0 WdV= [ pWdv 
Jro Jr 

as the deformation power , where Rq is the undeformed body. We can 
express P d in alternative forms based on different stress measures, that is, 


(14) A 


P d = / r : D dv = / cr : D dV = / T : F T dV = f S : E dV 
J R J Rq j Rq J Rq 

= / r* : \JdV = [ r:\JdV = f o> : (R T • D • R )dV. 

J Rq J Rq J Rq 
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Using Eqs. (10) and (16.2:12), one can derive the following relations among 
the stress measures: 

r-da - T T dA = F-S-dA = R-r* T -dA = R.<r r .U -1 -dA = R-ovR T -da/J, 

which means that the transformation of the force vector r ■ da by I, F -1 , 
R 7 gives the force vectors T t • dA, S • dA, r* T ■ dA, respectively on the 
undeformed surface. We also have the rotated force vector R T • (cr • da) ■ 
R — <r r • da* where da* = R T ■ da • R is a rotated surface of da. The 
geometric interpretation of the different force vectors is explained in more 
details below. 

Graphic Description of Stress Measures. Because of the importance 
of the stress tensors, we would like to offer a graphic description of the 
Lagrangian, the second Piola-Krichhoff and Biot-Lur’e stresses below. This 
is to supplement the graphic details presented in Chapter 3 for the Cauchy 
stress, and to do the derivation once more in the indicial notation, which 
was used in Chapters 3-15. Consider an element of a strained solid as 
shown on the right-hand side of Fig. 16.7:1. Assume that in the original 
(undeformed) state this element has the configuration as shown on the 

V 

left side of Fig. 16.7:1. A force vector dT acts on the surface PQRS. A 

V 

corresponding force vector dTo acts on the surface P 0 Q 0 RoSo- If we assign 

V V 

a rule of correspondence between dT and dTo for every corresponding 
pair of surfaces, and define stress vectors in each case as the limiting ratios 

V V 

dT/da, dTo/dA, where do and dA are the areas of PQRS, PqQoRoSq, 
respectively, then by the method of Chapter 3 we can define stress tensors 



Fig. 16.7:1. The corresponding tractions in the original and deformed state of body. 
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Fig. 16.7:2. The correspondence of force vectors in defining (a) Lagrange’s and (b) the 
second Piola-Kirchhoff stress, illustrated in a two-dimensional case. 

in both configurations. The assignment of a correspondence rule between 

V V 

d T and d T 0 is based on the equations derived above. 

The following alternative rules are known as the Lagrangian, the second 
PiolarKirchhoff and the Biot-Lur’e rules, respectively (see Fig. 16.7:2) 

(15) dT^ —dTi or d Tq /;) = d T (Lagrangian rule), 

(16) dT^P =~dTj or dT^W- 1 -dT (Piola-Kirchhoff rule), 

(17) dT { 0 f } -RijdTj or dT£ S) = R-dT (Biot-Lur'e rule). 

In other words, the Lagrangian rule assigns the force vector dTp ' acting 
on the surface element dA in the undeformed configuration to be equal to 

V 

the force vector d T acting on the corresponding surface element da in the 
current configuration; the second Piola-Kirchhoff rule specifies d Tq the 

V 

force vector acting on the surface element dA, to be related to d T by the 
same rule as the transformation dXi = whereas the second Biot- 

V ;m V 

Lur’e rule specifies d Tq to be d T rotated by R. Note that, in the present 
case, both the deformed and undeformed configurations have the same base 
vectors. There is no need to use capital subscripts to distinguish indices 
referred to the undeformed configuration. 

Let n and N be the unit vectors normal to da and dA, respectively. If 
Tij is the Cauchy stress tensor referred to the strained state, from Cauchy’s 

V V 

relation, the vector d T, with components dT t , can be written as 

1/ V 

(18) dTi = Tjinjda , or d T = r • da. 
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We now define stress components referred to the original state by a law 
similar to Eq. (18). If Eq. (15) is used, we write 

(19) d = TjiNjdA = dTi, d= T r • dA = d T, 


according to the Lagrangian rule. Note that 7)j ^ Tji. If Eq. (16) is used, 
we write 

(20) dT ( * ] = SjiNjdA = d ~dT j , d t[ K) = S ■ dA = F~ 1 • dT , 

according to the second Piola-Kirchhoff rule. Note that S t j = 5^,- and that 
Tij,Tij,S{j are the Cauchy, the Lagrangian, and the second Piola-Kirch¬ 
hoff stress tensors, respectively. 

The relationship among Tij,Tij and Sij can now be derived. Using the 
relation between riida and N t dA given in Eq. (16.2:12), from Eqs. (18) and 
(19), we obtain 

dX 

TjiNjdA — TjiUjda = Jr mi -—- NjdA , or 

OXm 

N • T dA = T-nda = Jr-~- NdA , 
ox 

which implies 

(21) Tji = , or T = JF -1 • r = F _1 • cr, 

OXjji ox in 


which is the same as that defined in Eq. (4), where J is the Jacobian of the 
transformation from the undeformed to the deformed configuration defined 
in Eqs. (16.2:5), (16.2:16), or (16.2:17). Similarly, from Eqs. (18) and (20), 


( 22 ) 


Sji 


dXj dXj 

dx m dx n 


dXj dXj 

dx m dx n 


nm > 


or 


S = JF- 1 • t . (F" 1 ) 7, = F- 1 ■ cr • (F _1 ) t 


as given in Eq. (6). Hence, from the two equations above, we find the 
relation between S and T 

Sji = T in ^~, or S = T-(F- 1 ) r . 

Similarly, we can show the relation between r and o> using Eq, (17). 
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The Eulerian (Cauchy) stress tensor r,j is symmetric as shown in Chap¬ 
ter 3. From Eq. (21), we see that, in general, the Lagrangian stress tensor 
Tji is not symmetric, whereas from Eq. (22), we see that the second Piola- 
Kirchhoff’s stress tensor Sji is. The Lagrangian tensor is inconvenient to 
use in a stress-strain law in which the strain tensor is always symmetric, but 
is often convenient in laboratory work. The second Piola-Kirchhoff stress 
tensor is more suitable for the purpose. 

From the identities 

_ dXi dx m _ dxi dX m 

ij ~]teZdX~' ij ~ dX m dxj ’ 


we find at once the relations 


(23) 


(24) 


_ 1 dxj _ 1 dxj dxj c 

Tji ~ J dX m ~ J dX m ax n nm ' 

F T F • S • F r 
T ~ J J 

Tij = S im -^L, or T = S • F t . 

O-A-m 


or 


16.8. EXAMPLE: COMBINED TENSION AND TORSION LOADS 

Consider a thin-walled circular cylinder subjected to combined tension 
and torsion loads (Khan and Huaung 1995). The geometry and deformation 
are described in terms of polar coordinates. The geometry of the tube is 

R m -h/2<R<R m + h/2, 0<B<2n, 0 <Z<L, 

where h/Rm, h/L 1. The point at (R, 0, Z) before deformation moves 
to ( r,9,z ). In polar coordinates, the deformation can be written as 

r = aR , 6 = Q + ujZ, z = \Z . 

Here u> and A are the angle of twist ratio and the axial stretch ratio per 
unit undeformed length, respectively. The base-vectors for the deformed 
and undeformed coordinates are, respectively, (e ri eg,e z ) and (e#, eg, e^). 
They are related to the fixed Cartesian base vectors (ex,ey,e^) by 

(1) e r = cos Bex + sin Bey = cos(0 + uZ)qx + sin(© + wZ)ey , 

(2) eg = — sin Bex + cos Bey = — sin(@ + u>Z)ex + cos(© + u>Z)ey , 
and 

e z =ez , e# = cos ©ex + sin ©ey , ee = — sin ©ex + cos Qey . 
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The base vectors e r , eg are functions of time and the undeformed coordi¬ 
nates. The deformed position and the velocity vector are, respectively, 

(3) x = re r + ze z = Rae r + ZXe z , 

(4) x = Rae r + RZad>e$ + ZXe z . 


The second term on the right hand side of the last equation arises from the 
time derivative of e r . The deformation gradient tensor F and its material 
time derivative F can be derived by taking the spatial derivatives of Eqs. (3) 
and (4) with respect to the undeformed coordinates. In doing so, one must 
account for the fact also that both e r and eg are functions of 0 and Z. 

As an example, consider the derivation of F from x through the spatial 
derivative with respect to the undeformed coordinates: 

■ die 1 9x die 

= dR eR + RdQ e& + oz ez ’ 

where 


(5) 


die . 

— = e r F r R + egFgR + e z F z R = e r a + egZaui, 

1 die ■ ■ ■ . 

00 = e r F r Q + egFgQ + e z F z Q — —e r Zau> + egdt , 

die 

— = e r F rZ + egFgz + e z F zZ 

= —e r RZauJu> + egR{aw + dev) + Ae*, 


where F v r, F r e, ■ • • are the components of the two-point tensor F. The 
lower-case subscripts denote the association with the current coordinate 
frame and the upper-case subscripts denote the association with the ref¬ 
erence frame. A comparison of the components on both sides of Eq. (5) 
yields F r R, F r e, ■ ■ ■, etc. Following this procedure, one obtains 


( 6 ) 


(7) 


F = 


F = 


dr 

1 dr 

dr 



dR 

RdQ 

dz 


a 

dd 

r dO 

de 

__ 

0 

r dR 

Rd& 

r dz 


0 

dz 

1 dz 

dz 


M 

ROQ 

dz 



a 

—ZolCj 

— RmZaulU! 

Zaui 

a 

R m (au> + au)) 

0 

0 


A 



0 0 

a uiaR m , 
0 A 
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in which the mean undeformed radius R m has been used to replace R. Prom 
Eq. (6), it follows that 


p - 1 


1/a 0 0 

0 1/a —ujR m /X 

0 0 1/A 


J = det(F) = a 2 A. 


To obtain R, U and V in the polar decomposition, one writes the rotational 
tensor in the form 


R = 


The solution of the equations 
stretch tensors 


1 

0 

0 

F 


0 0 

cos 4> sin <fi 

— sin <j> cos 4> 

= R • U = V • R gives the right and left 


U = a 


1 0 0 
0 cos 4> sin <j> 

0 sin <f> wRm sin 0 H— cos (j> 


V = 


a 0 0 

0 aujRm sin <f> + A cos <f> Xsiiup , 
0 A sin 0 A cos <j> 


where 

cos 4> = — ~ — , sin (j> = —, d = \/(X + a) 2 + (awf? m ) 2 . 

a a 

The other quantities can be determined in a straightforward manner: 

a 2 0 0 

0 a 2 u>ot 2 R m . 

0 u>a 2 R m A 2 +w 2 a 2 /?^, 


C = F T F = 
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B -i = (F _1 ) r • F" 


wR m 1 w 2 i^ 
' aA A 2 + A 2 


The Lagrangian and the Eulerian strains can be obtained by using E 
(C — I)/2 and e = (I — B _1 )/2. The velocity gradient tensor is simply 

a/a —wZ 0 

L = F • F -1 = CjZ a/a u)aR m /A 


Hence the stretch and spin rates are 

a/a 0 

D = 0 a/a 

0 waR m /(2X) 


ujaRm/( 2A) 
A/A 


waR m /(2X) 


0 —u)aR m /( 2A) 


One can obtain L directly by expressing x in terms of the base vectors of 
the current configuration 

rd rzd) zX 

x = — e r d—T— e 0 + — e z , 
a A A 

and taking the spatial derivatives with respect to the deformed coordinates, 
i.e.,L=||. 

The Cauchy, first Piola-KirchhofF and second Piola-Kirchhoff stresses 
are given by: 

0 0 0 

T = 0 0 T , 
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T = JF- 1 • r 


S = T • (F 


= 


0 0 0 

0 —Ta 2 u>R m ra\ — aa 2 uiR„ 
0 to t 2 aa 2 


ooPoPRffn aa 2 wR„ 

0 -2 TcttoR m +--—— ra. - r—^ 


a a 2 u>R„ 


Here r is deduced from the applied axial and torsion loads. A constitutive 
law is needed to determine the relation between r, a and A, u>. 


Problem 16.4. Show that E = F T ■ e • F where E is the Green strain and 
e is the Almansi strain. 

Problem 16.5. In a simple shear experiment, the motion is described by 
x = A + 7 y , y-Y , z = Z. 

Find F, C, B, R, U, V, the principal stretches and the eigenvectors of C and B. 


16.9. OBJECTIVITY 

Truesdell and Noll (1965) discussed the concept of frame indifference 
of world events and the significance of the concept in the formulation of 
mechanics. Eringen (1967) introduced the term axiom of objectivity. Sim¬ 
ply stated, the constitutive equation must be invariant under changes of 
reference frame. A quantity or an equation is frame indifference or ob¬ 
jective if it is invariant under changes of reference frame. We give below 
the mathematical definition of the term and examine what quantities are 
objective and what are not. To appreciate the beauty and significance of 
these concepts, one must read the original papers. 

Consider two motions x(X, t ) and x(X, t) of a body that differ by a 
rigid-body motion, that is, 

(1) x(X,f) = Q(f)-x(X,t)+c(f), 

where Q (t) is a rotational tensor and c (t) is a vector representing a rigid 
body translation. All quantities associated with the two deformations will 
have different values in general. In the subsequent discussion, we shall 
use the notation (•) and (•) to denote the same relevant quantity in the 
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barred (x) and the unbarred (x) configurations. For instance, F and F will 
denote the deformation gradients in the x and x configurations, respectively. 
Truesdell and Noll, and Eringen defined that a vector or a tensor of rank 2 
is frame indifferent or objective if its corresponding quantities in the two 
configurations are related by the following transformation rules: 

(2) A v = Q • v, 

if v is a vector, and 

(3) A G = Q G Q r , 

if G is a tensor of rank 2. 

We shall now examine the objectivity of various quantities associated 
with the two deformations. First consider a line element dx in the barred 
configuration and its corresponding line element dx in the unbarred config¬ 
uration. From Eq. (1), one finds 

(4) dx. = Q • dx, 

which satisfies the transformation rule Eq. (2) for a vector. Hence the line 
element dx is an objective vector. One can also show that all area elements 
and all volume elements are objective. In fact a volume element is always 
objective because it is a scalar, which is always so. 

Now consider the deformation gradients F, F in the two configurations. 
Since 

F • dX = dx = Q • dx = Q • F • dX , 

it implies 

(5) F = Q F. 

This shows that F is not objective because F, being a tensor of rank 2, does 
not follow the transformation rule Eq. (3) for tensor of rank 2. Note that 
F is a two-point tensor. The components associated with the deformed 
coordinates change with the reference frame while those associated with 
the undeformed coordinates do not. This is because the undeformed coor¬ 
dinates are fixed whereas the reference frame for the deformed coordinates 
changes. Following Eq. (5), one finds 

(6) c = F T • F = F r • Q Q r • F = F T • F = C, 

which indicates that C is also not objective because the transformation does 
not follow the rule given in Eq. (3). We should expect C not to change as 
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shown in Eq. (6) for different reference deformed coordinates because C is 
a Lagrangian tensor referred to the original frame, which does not depend 
on the current reference frame. In the mean time, 

B = F • F T = Q • B • Q T , 

which transforms like Eq. (3). Therefore, B is an objective tensor. This 
is also expected because B is an Eulerian tensor referred to the reference 
frame. Similarly, the Lagrangian strain tensor E is not objective, because 

E = E, 

which does not satisfy Eq. (3). On the other hand the Almansi strain tensor 
e, an Eulerian tensor, transforms as 

e = Q • e ■ Q t , 

and is, therefore, objective. Using the polar decomposition and Eq. (5), 
one derives 

F = R U = Q F = Q R U. 

Since Q • R is still a rotational tensor, and U and U are symmetric, and 
since the polar decomposition is unique, the equation above implies 

(7) R=Q R, 

(8) tr = U, 

which shows that neither R nor U is objective. However, for the left polar 
decomposition, 

F = V R = Q V R, 

one obtains 

(9) V = Q • V ■ R • R t = Q • V • Q t , 

which means the left stretch tensor V is objective. Thus B, e, and V, which 
are all Eulerian tensors, are objective while C, E, and U, all Lagrangian, 
are not. 

Vectors and tensors related to the velocity field are in general not ob¬ 
jective. To show this result, we first establish the properties of the rotation 
rate tensor. Let us define = Q r • Q. Since Q T • Q = I, it follows that 

^(Q t -Q) = Q t -Q + Q t -Q = * t + * = 0, 

which indicates that ’J>(= — 'if 7 ') is antisymmetric and that 

Q = -Q Q T Q ■ 
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For an objective vector v, i.e., v = Q v, the following is true 

(10) if = (Q ■ v) = Q • (v 4- • v) , 

which means that v is in general not objective. Similarly, for an objective 
tensor G of rank 2, i.e., G = Q G • Q T , one can show that 

(11) G = Q (G + * G-G *) Q r . 

Now consider the deformation rate. It can be shown that 

F = Q-(F + *-F)=L-F, 

which leads to 

(12) L = Q(F + ^F)F _1 ■Q T = Q(L+’^)Q r = Q(D+0 + ^ , )Q T , 

where L = D + O, and D and ft are the deformation and spin rate tensors. 
Splitting L into the symmetric part D and the antisymmetric parts Cl, i.e., 
L = D + O, we obtain from Eq. (12) 

(13) D = QDQ t , 

(14) Ci = q . (n + *) q t = q n q t + q q t . 

One sees that only D satisfies the transformation law of objectivity while 
both the velocity gradient tensor L and the spin tensor Cl do not. 

We shall now examine the objectivity of stress measures. The objectiv- 

V 

ity of the surface traction T is based on physical grounds. Then 

f • n = T = Q • T = Q • r • Q t • Q • n = Q • r • Q T • n, 
where n is the unit normal to the element. Therefore 

(15) t = Q t ■ Q t , 

which indicates that the Cauchy stress is objective. This is expected be¬ 
cause, similar to the case of strain measures, the Cauchy stress is an Eu- 
lerian tensor and, therefore, is objective. For the Lagrangian stress T and 
the second Piola-Kirchhoff stress S, it is easy to show that 

(16) T = T Q t , S = S , f* = r* Q T , & r = a r , 

all of which are not objective. This can be expected because T and r* are 
two-point tensor and S and o> are Lagrangian tensor. 

Finally, we shall discuss the transformation of stress rates. Stress rates 
are especially needed in plasticity theory where the constitutive equations 



Sec. 16.9 


OBJECTIVITY 


547 


are often expressed in terms of stress increments (rates). They are also 
needed in the incremental approach for solving large deformation prob¬ 
lems. Prom Eq. (16.7:10), using Eqs. (16.5:3a) and (16.6:6) for the rates of 
deformation, one find the rates of various stress measures: 

(17) ± = JF- 1 • [r + tr(D)r - L • r], 

(18) S = JF" 1 ■ [f + tr(D)r - L • r - r • L r ] • (F _1 ) T , 

(19) r* = JF -1 • [f + tr(D)r — L ■ r + r • fi] • R, 

(20) & r — R t ■ [r — SI • t + r • fl tr(D)r] • R. 

Using Eq. (14) that Q = fl ■ Q — Q ■ fl and differentiating Eqs. (15) and 
(16), one can establish that 

(21) f — fl'f + f'fi = Q-(f —fi'7 + T’fi)' Q t , 

(22) T + T • fi = (T + T • ft) • Q t , 

(23) § = S, 

(24) f* + r*-f} = (r*+r*-ft)-Q T , 

(25) & T = & r . 

None of the material rates r, T, S, r* and & r is objective. However, from 
Eq. (21) we see that 

(26) T 9 = T — U-T+T-fl 

is objective and called the Jaumann ( corotational ) stress rate. 

In general, for an objective tensor G, rates such as 

G e = G — O G + G- fi, (Jaumann rate) 

G v = G - L G - G • L t , (Oldroyd rate) 

G° = G -(- L t • G + G - L (Cotter-Rivilin rate) 

are all objective. They are respectively, called the Jaumann, Oldroyd, and 
Cotter-Rivlin rates. It can also be shown that the rate G + tr(D)G + L T - 
G + G ■ L introduced by Truesdell is objective if G is so. 

To conclude this section, we mention two useful properties of rates: if 
G is a symmetric tensor, then 

^(G:G) = 2G:G a ; 

and if G is a deviatoric tensor, so is G s . 
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Problem 16.6. Let G* be a Lagrangian tensor and G an Eulerain tensor. 
If G is related to G* by G = F r G* F _1 , the Cotter-Rivlin rate of G is linearly 
related to G*; and if G = F ■ G* ■ F T /J, then the Truesdell rate is linearly related 
toG*. 


16.10. EQUATIONS OF MOTION 


Consider the body of a continuum occupying a region v with a boundary 
surface dv in the deformed state. The corresponding region in the original 
(natural or zero-stress) state is V with a boundary surface dV. The body is 
subjected to external loads, which consist of the inertia force, the body force 
b (with components &i,& 2 ,& 3 ) per unit mass acting on a volume element 
dv, and the surface traction T acting on a surface element da. whose unit 
outer normal is n. All these variables are in the Eulerian description. 
The corresponding variables for the body force and surface traction are 
bo in dV and To on dA with outward unit normal N in the Lagrangian 
description. The original density is po and the corresponding density in the 
deformed state is p. The notations to be used are given in the accompanying 
table below. 


Region 

Boundary surface 
Volume element 
Surface element 
Unit outer normal 
Particle coordinates 
Body force per unit mass 

Surface traction 

Density 

Stresses 


In Deformed 

Configuration 

v 

dv 

dv = dxidx2dx-3 

da 

n, (ni,n 2 ,n 3 ) 
x, (xi,x 2 ,x 3 ) 
b,(61,62,63) 

T, (Ti, T 2 , Ta) 

P 

r,(Tij ) (Cauchy) 

< 7 , (< 7 ij ) (Kirchhoff) 


In Undeformed 
Configuration 
V 
3V 

dV = dX v dX 2 dX 3 

dA 

N,(N lt N 2 ,N 3 ) 

X, (X U X 2 ,X 3 ) 

bo, (bo 1 ,bo 2 ,bo 3 ) 

To, (To 1? To 2 , To 3 ) 

Po 

T, (Tjt) 

(1st Piola-Kirchhoff) 

S, (Su) 

(2nd Piola-Kirchhoff) 


Note that the traction T is a vector (tensor of rank 1) and the first Piola- 
Kirchhoff stress T is a tensor of rank 2. 

Consider the motion of the body. Conservation of mass requires that 
pdv = podV and b = bo, i.e., the mass of a volume element dV and the body 
force per unit mass do not change as the body deforms. The conservation 
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of momentum gives the Eulerian equation of motion of the continuum 


(1) V x .T + pb = / >— 

[Eq. (5.5:7) in the present notation]. This is the equation of motion in the 
current configuration. We can write the equation of motion in a different 
reference frame and express the variables in terms of coordinates in that 
configuration. One way of doing this is to write the equation of motion in 
the integral form as follows: the sum of the total body force and inertia 
force acting on the region v(V) is 


( 2 ) 


b- 


/(• 


Dv' 

Dt, 

Dvi 

~Dt 


pdv = 

pdv =L(: 


Dv 

Dt 

Dvi 

~Dt 


PodV, 

PodV. 


The resultant of the surface traction T acting on the surface dv (or dV) is 

/ T da — / n • rda = / N • T dA , or 
Jdv Jdv JdV 


(3) 


/ Tida= I Tjiiijda — / NiTudA, 
Jdv Jdv Jev 


in which Eq. (16.2:12) has been used and T is the Lagrangian or the first 
Piola-Kichhoff stress tensor. The use of Gauss’ theorem leads to 

(4) [ T da= [ V • T dV , or f Tida = / ~ dV, 

Jdv Jv Jdv Jv dXi 

where V is the gradient operator with respect to the X-coordinates. Hence, 
by Eqs. (2) and (4), the equation of motion can be written as 


(5) ^(pobo-po^ + V-TjdV^O. 

Since this equation must be valid for an arbitrary region V, the integrand 
must vanish. Hence 

Dv Dv-i 

(6) k V ■ T + p 0 bo = Po-p£ i Ti<i,K + poboi = Po~^ 

is the equation of motion in terms of the first Piola-Kirchhoff stress. The 
divergent operation is with respect to the undeformed coordinates. A sub¬ 
stitution of Eq. (16.7:6) into Eq. (6) yields the equation of motion in terms 
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of the second Piola-Kirchhoff stress: 

V ■ (S • F T ) + pobo = Po~fff > 

(7) A 9 (r dxi h Dvi 

dX~j { SjK dX^) +Pobo '- po ~Di- 


Since x = X + u, where u is the displacement vector, Eq. (7) can be 
written as 


(8) A 



, Dv 

+ Pobo = Po-p£ » 


or in indicia! notation 


(8a) ▲ 




du, \ 
dXj) 


, , Dv, 

+ Pobo, = Po > 


if the deformed and undeformed configurations have the same base vectors. 
In this case, there is no distinction between the upper and lower case indices. 
Equation (8) exhibits the geometric effect of finite deformation through the 
displacement gradient du,/dXj. 


16.11. CONSTITUTIVE EQUATIONS OF THERMOELASTIC 
BODIES 

Equations that describe the mechanical properties of materials are called 
the constitutive equations. Materials may be classified according to their 
constitutive equations. The constitutive equations do have to be restricted 
by the laws of thermodynamics. The case of small deformation is discussed 
in Chapters 12 and 13. Large deformation does make the mathematical 
theory more complex. In this section, the thermo-mechanically-simple case 
of an elastic material will be discussed. An elastic material is defined as one 
in which the Cauchy stress r depends only on the strain, the internal energy, 
the temperature and the entropy; it does not depend on the strain rate, 
stress rate, electromagnetism, chemical reactions, and biological processes. 
The material has no energy dissipation or energy creation mechanism and 
is subjected to reversible processes. There is an extensive literature on the 
subject (Jones and TYeloar 1975, Ling et al. 1993, Ogden 1986, Valanis 
and Landel 1967, Wang et al. 1995). A review of hyperelasticity of rubber, 
elastomers and biological tissues is given by Beatty (1987). 

The constitutive equations must be restricted by the laws of thermody¬ 
namics. The first law of thermodynamics states that, for a closed system, 
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the rate of work done on the systems by all external agencies must equal 
to the rate of increase of the energy of the system. The rate of mechanical 
work done on a volume v by the external forces is 


(1) W m = T ■ vda + I pb • v dv, 

Jdv Jv 

V 

where T is the surface traction and b is the body force per unit mass. The 

V 

use of the equation T = n- r, Newton’s law and Gauss’ theorem in Eq. (1) 
leads to 

(2) W m = ~ f ipv • v dv + f t :T)dv 

i-'t JV ^ Jv 

+ 1 .** 1 *- 

where are the components of D. The heat flow into the body is 


h • n da + 


/ PQdv , 

J V 


where h is the heat flux vector, n is a unit normal to dv, and q is the 
heat source per unit mass. The total energy of the system consists of the 
kinetic energy and the internal energy. The first law of thermodynamics or 
principle of energy balance requires that the rate of increase of the total 
energy of the system is equal to the sum of the mechanical work rate and 
the heat flow. Hence 


J i pViVidv + ~ J p£ dv = W m + Q , 


where £ is the internal energy per unit mass. Since the volume v is arbitrary, 
Eq. (3) implies 


p£ = t : D + pq — V • h, 


p£ — Tjj C-ij pq hi t i . 


The physical implication of reversibility of processes in an elastic body has 
been discussed in Chapter 12. It is shown in Sec. 12.3 that in isotropic 
process, the internal energy can serve as the strain energy function so that 


In isothermal processes, the Helmholtz free energy per unit undefo'rmed 
volume Wo can serve as the strain energy function, so that 
1 dWo 1 dW 0 
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Alternatively 

(7) 


S - 


dW 0 

dE 


Hence, from Eq. (16.7:7) the Cauchy stress can be expressed as 

( 8 ) 


I F .^. F T 
J dE 


If the material is isotropic, W 0 must be a function of invariant of E or 
C(= I + 2E). A set of such invariant is 

h = tr(C) = 3 + 2 E kk , h = (/? ~ C : C)/2 , I 3 = det(C) = J 2 . 

One can then write W 0 in the form Wo(T, h,h,h)- It can be shown that 


dh 

dC 


= I, 


dh 

dC 


hl-C, 


dh 

dC 


= he- 1 . 


Then the constitutive law becomes 
dW 0 


(9) A 
where 

( 10 ) 


S - 2~- - CiI + C 2 (/iI - C) + C 3 / 3 Cr 1 


Ci 


dW 0 


C 2 =2 


dW 0 


dh ’ “ “ dh ' 

Since S = 0 for zero deformation, we require that 


C 3 = 2 


dW 0 
dh ’ 


( 11 ) 


Ci + 2 C 2 + C 3 — 0 or 


dW 0 dWp dW 0 __ 

dh dh dl 3 


as h — 3, h — 3, h = 1 and C — I. From Eqs. (9) and (16.7:6), one 
obtains the constitutive law for the Cauchy stress 


(12) A 


r = [CiB + C 2 (/!B - B 2 ) + C 3 / 3 I]/ J. 


From Eq. (16.7:11), p = — (r : I)/3 = —(S : C)/3J, where p is the 
hydrostatic pressure, it follows 


(13) 


P= ~3J 


dW 0 dW 0 , , 

dh dh dl 3 


= -^j[Cih+2C 2 h + 3C 3 h] 
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One can split as follows the stresses into a distortional part, which involves 
the hydrostatic pressure p, and a deviatoric part, which has a zero mean as 
defined in Eq. (16.7:13): 

(14) S = -pJCr 1 + C x ^1 - 

(15) r = pi 4" C\ j. 

A special form of the energy density is 

(16a) W 0 = [C^h - 3) + C 2 (/ 2 - 3) + f > 3 (/ 3 - 1) + b 4 (I z - l) 2 ]/2, 

with Ci, C 2 , 63,64 being constant and C 3 = f> 3 +2 & 4 (/ 3 — 1 ). Equation (16a) 
represents an expansion of Wo in a power series of /1 — 3, / 2 — 3, and / 3 — 1. If 
the last two terms of Eq. (16a) are dropped, it reduces to the Mooney-Rivlin 
model for certain incompressible rubber-like materials. 

For infinitesimal deformation, we can approximate 

/ 2 = 3 + 4 Ekk, h — l+2EKKi J = 1 + Ekk, C - 1 =I-2E. 


^jCT 1 ) d-Ca^I-C- , 


where E's are strains. The energy density becomes 

Wq = + 2C 2 + f>3)E{(K ~~ (b 3 + C 2 )EjjEjj + (b 3 + 2 b 4 + C 2 )(Ekk ) 2 • 


To assure that W 0 is consistent with Hooke’s law, we require 

2Gi/ 

2Ci 4 - 4C 2 4- 26 3 = 0, —6 3 — C 2 = G , 26 3 -1- 46 4 + 2C* 2 = X — - — — 


i.e., 

C 2 = C-C„ h = 0,-20, 6.= |i =^ y 

where G is the shear modulus, X is the Lame constant and v is Poisson’s 
ratio. fVom Eqs. (9) and (12) or Eqs. (14) and (15), we obtain Hooke’s law 


r = S = 2 GE + XEkkI = 2 G 


( e ' 5 £ - 1 ) 


-pi. 


Another commonly used energy density is 


(16b) Wo = \Ci(h - 3) + C 2 (I 2 - 3) + 6 3 (1//| - 1) + b 4 (h - l) 2 ]/2. 

For infinitesimal deformation, the energy density becomes 

Wq — (C\ + 2 C 2 — 2 b 3 )Ef(K + ( 2 b 3 — C 2 )EijEm + (46 3 + 26 4 -f C 2 )(Ekk) 2 . 
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with 

2 C\ + 4C2 — 463 = 0, 263 — C2 = G , 863 + 464 + 2C*2 — A = 

i.e., 

1 

b 4 


C\ + C2 — G , l >3 = -Ci + C2 , 


2 Gu 

1 — 2i/ ’ 

(5i/-2)C 1 + (11 i/-5)C 2 
2(1-2 v) 

For incompressible materials, 7 = 1, Eqs. (7) and (8) no longer hold 
and so are Eqs. (9) and (12). Since the rate of volume change is 

DJ DJ ■ , • 

Th=de :B ‘ jc :E ' 

the incompressibility condition requires 
(17) A C -1 : E = 0 , 


which represents an internal constraint to E. We can modify Eq. (7) to 
reflect the constraint of Eq. (17) and obtain the constitutive equation 

(18) A S = -p'C~ l + ^ , 

where p' is a Lagrangian multiplier relating to the hydrostatic pressure. The 
expression Eq. (18) is not unique for incompressible materials when 7=1 
is taken into account. For example, if W 0 is replaced by Wo + /(C)(7 - 1), 
the value of Wq remains the same, but the derivative of W 0 with respect to 
E changes. The constitutive equation becomes 

S = -p'C- 1 + ^ + /(C)7C- 1 . 

However no ambiguity arises in the determination of the deviatoric stress 
as defined in Eq. (16.7:12) 


(19) 


S" = S - i (S : C)C -1 


8W 0 

dE 


1 

3 



c- 1 . 


The additional term in S gives no contribution to the deviatoric stress. We 
can then rewrite the constitutive equation in the form 


( 20 ) 


S = —pC -1 + 


dw 0 

dE 



c- 1 , 


(dW 0 \ 


where 


1 
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is the hydrostatic pressure as defined in Eq. (16.7:11) and has to be 
determined directly from the field solution. Accordingly the constitutive 
equation for the Cauchy stress becomes 



If the material is isotropic and incompressible, one can write Eqs. (20) 
and (22) in the form of Eqs. (14) and (15). In other words, the constitutive 
laws Eqs. (14) and (15) are valid for incompressible materials, except that 
the hydrostatic pressure must be determined directly from the field equa¬ 
tions instead of Eq. (13). For infinitesimal deformation, Eqs. (14) and (15) 
reduce to 

r == S = -pi + 2G(E - E kk 1/3). 

We can establish a formulation valid for compressible as well as incom¬ 
pressible materials. Following the division of the stresses into the deviatoric 
stress and hydrostatic pressure, we split the deformation into a dilatational 
part, representing the volume change of a differential element of the body, 
and a distortion of its shape at constant volume. The procedure is known 
as kinematic split, which involves multiplicative decomposition in the case 
of finite strain. This formulation is particularly useful for treating nearly 
incompressible materials. Let us define 

(23) U' = UJ" 1/3 , 

then 

U = J 1/3 U' = U' • I J 1 ' 3 , det(U') = det(U) J" 1 - 1. 

We thus obtain the multiplicative decomposition with U' representing the 
distortion of the right stretch tensor U, whereas J 1 / 3 ! is the dilatation. 
As we shall see later the decomposition of the deformation rate becomes 
additive and is in the same form as that in the infinitesimal strain case. 

Accordingly, one can define 

C = CJ ~ 2/3 , 


and write the strain energy density in the form 
(24) W*(O,J) = W 0 (C) 

such that 


(25) A 


„ dW 0 *(C',J) dW£(0, J) 

’ “ dO ’ V ~ dJ 
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The relationship between the second Piola-Kirchhoff stress S and S', and 
p can be derived as follows: 


(26) A S:SE ^ Wo ,^Jl: SC ' + ^ VQ„ 


= S' : 5C' - p6J. 

Prom the property of determinant [J = y'det(C)], it can be shown that 

w = S :ic =5 c “ :SC ' 

Using the result above, one derives 

SC' = J- 2/3 SC - ~ J~ 5 / 3 CSJ = J- 2/3 SC - ~ J" 2 / 3 C(C _1 : <5C). 

A substitution of the last two equations into Eq. (26) leads to 

(27) A S = 2J~ 2 / 3 S'- 2J- 2 / 3 (S' : C)C -1 /3-pJC -1 . 

Taking the inner product of the equation above with C, one can show that 

(28) P=~h { S' 0 )’ 

in which the result C -1 : C = 3 has been used. Equation (28) establishes 
that p as defined in Eq. (25) is indeed the hydrostatic pressure, the mean 
of the Cauchy stress. Note that S' is not deviatoric, since S' : C / 0. 


Problem 16.7. Let h, h and I 3 be the principal invariance of C. Show that 
f 2 = (/? — C : C)/2 = I 3 tr (C -1 ). 

For incompressible materials I 3 = 1, then I 2 = tr(C _1 ). (Hint: Express the 
invariance in terms of the eigenvalues of C.) 


Problem 16.8. Show that 

C = hi — I 2 C~ 1 + hC~ 2 , 

without using the Cayley-Hamilton theorem. (Hint: Use the same approach as 
that for Problem 16.7 or differentiate the result of Problem 16.7 with respect 
to C.) 

Problem 16.9. Using C = /jl — hC~ l + hC~ 2 and I 2 = / 3 tr(C _1 ), 
show that B 2 = /(B — I 2 I + J 3 B -1 and that 

S = -pJC- 1 + Ci (i - i AC" 1 ) 4- C 2 Q / 2 C - 1 - J 2 C“ 2 ) , 

T = - p I + C,(B-l/,l)i + C,(l F W— JB- 1 ). 
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16.12. MORE EXAMPLES 


Uniform Tension. We consider a unit cube with sides parallel to the 
Cartesian axes ( X , Y, Z) and is deformed into a cuboid of dimensions 
Aj, A 2 , A 3 parallel to the X, Y, Z axes, respectively. The material is com¬ 
pressible and the energy density is in the form as Eq. (16.11:16a). The 
deformed coordinates are referred to the Cartesian axes x, y, z, such that 


( 1 ) 


x — AiX, y = A 2 F, z = X 3 Z. 


The components of the deformation gradient F, the metric tensors C, 
B and the Green strain tensor E are 



' Aj 

0 

0 n 


' A? 

0 

0 

F = 

0 

A2 

0 

, C = B = 

0 

A? 

0 


0 

0 

A 3 


0 

0 

A? 



Af — 1 0 0 

0 A| — 1 0 

0 0 A? -1 


in which B is objective and F, C and E are normally not. The invariants 
are 


h — tr(C) = A? + A? + A?, 
h = (A 2 - C : C )/2 = A?A? + A 2 A 3 + AlA\, 


h = det(C) = J 2 - XjXlXl. 


The inverse of C is simply 


c - 1 = 


1/A? 0 0 

0 1/A? 0 

0 0 1/A? 


R-om Eqs. (16.11:9), (16.11:12), it follows 



' 1 

0 

0 ‘ 


A? -j- A? 

0 

0 

S = <?! 

0 

1 

0 

+ c 2 

0 

A? + A? 

0 


0 

0 

1 


0 

0 

A? + A? 


1/A? 0 0 

0 1/A? 0 

0 0 1/A? 


+ C 3 A? A? A? 
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r = 


Ci 

J 


A? 0 0 

0 Af 0 
0 0 Ag 



(A| + A§)A? 0 0 

0 (Af + A?)Af 0 

0 0 (Af + Af) Ag 


+ 


C 3 

J 


AlAgA| 


1 0 0 
0 1 0 
0 0 1 


where, from Eq. (16.11:16a) 


C 3 — &3 + 2& 4 (Af AgAg — 1). 


Note that r is objective and S is not. All stress components are constant 
and therefore the equilibrium equations are satisfied. Since the deformation 
involves uniform stretching only, all shear stresses are zero and the com¬ 
ponents of S are proportional to those of r, i.e., r ti — SuXf/J, i = 1,2,3 
and are not summed. These results are also true locally for nonuniform 
deformation if the referenced coordinates coincide with the principal stress 
directions where Aj is the stretch in the ith principal direction. 

For the particular case of simple extension under a force parallel to the 
x-axis, A 2 — A 3 , T 22 — T 33 = 0, hence 

A 1 T 22 = Ci + C<z{ Af + Af) + [63 + 26 4 (AfAf — l)]AfAi = 0. 

One can solve for Af in terms of Af. The above equation gives only one real 
solution for Af. Replacing \ 2 K with 1 + 2 Ekk, (/^ = 1,2, not summed) and 
solving for E 2 2 , one finds 


E22 

E n 


= -v{Eu ), 


in which v{E\\) is a complicated function of Eu, For infinitesimal defor¬ 
mation, it reduces to the well-known Poisson’s ratio 

/ p \ C 2 + t>3 + 264 _ A 
n C 2 + 63 + 4&4 - 2(A + G) ' 
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For incompressible materials, I3 = 1 , the Cauchy stress for the Mooney- 
Rivlin material becomes 


T 



CJi 

3 + 


2 C 2 I 2 ^ 


I + Ci 


Af 0 0 

0 X\ 0 

0 0 Af 


+ O 2 


(A| + Af)Af 0 0 

0 (A| + Af)Af 0 

0 0 (Af + A|)A| 


Problem 16.10. Find the Cauchy stress for the uniform extension problem 
with the strain energy density 

Wo = [h(h - 3) + b 2 (I 2 - 3) + b 3 (J - 1) + W{J - l) 2 ]/2 , 

where b’s are constants. Show that (a) we must have 

26 i -f- 462 -b b 3 = 0 , 


to assure that the stresses are zero for zero deformation; (b) for simple extension 
in the X-direction 


E22 

E u 


—t/(Eu) 


2(b 2 + 64)^11 + (61 + 2 f »2 + 1*4) (1 — V 1 + 2 Eu) 

4[fe2 + 1>4(1 + 2En)}Eu 


Simple Shear. Consider the simple shear deformation 

(2) x = X+yY, y = Y, z = Z, 

where ( x,y,z ) and (X,Y,Z) are, respectively, the deformed and unde¬ 
formed Cartesian coordinates. The matrices for deformation and defor¬ 
mation rates are 



' 1 

7 

0 1 


cos <p 

sirup 

0 ‘ 

F = 

0 

1 

0 

, R = 

— sin<^> 

cos cp 

0 


0 

0 

1 


0 

0 

1 


cos <p sirup 0 

sirup (1 + sin 2 ip)/cos <p 0 
0 0 1 


U = 
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1 

7 

0 1 



r 1 + 7 2 


-7 

c = 


7 

1 + 7 2 

0 

, cr 1 

= 


-7 


1 


. 

0 

0 


1 



. 

0 


0 


■ 

1 + 7 2 

7 

»1 


2 

0 

7 

0 

- 

B = 


7 


0 0 

, E = 

2 

7 

7 2 

0 



- 

0 


0 

1 J 



0 

0 

0 

- 

I 

' 0 

7 

0 ' 


1 

■ 

0 

7 

0 



7 

0 

0 

, 

»=2 

- 

-7 

0 

0 

, 



0 

0 

0 




0 

0 

0 



where <p — tan 1 ( 7 / 2 ). The strain invariants are 

h=I 2 = 3 + 7 2 , h = J 2 = 1 • 

Note that the simple shear deformation does not involve any dilatation. It 
follows from Eqs. (16.11:9) and (16.11:12) that 



' 1 

0 

0 ' 


' 2 + 7 2 

-7 

0 

S = Cl 

0 

1 

0 

+ c 2 

-7 

2 

0 


0 

0 

1 


0 

0 

2 + 7 2 


+ C 3 




1 + 7 2 -7 


L 0 


0 

* j 




1 + 7 2 

7 

0 ■ 


' 2+7 2 

7 

0 

7 

1 

0 

+ c 2 

7 

2 

0 

0 

0 

1 


0 

0 

2 + 7 2 . 


+ C 3 


1 0 0 
0 1 0 
0 0 1 


The stresses are uniform and therefore satisfy the equilibrium equations 
automatically. It is interesting to note that 733 = S 33 ^ 0 unless 7 = 0 . 
In other words, if the body is compressible, we cannot have a traction free 
plane for simple shear deformation. 
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This would not be true if the body is incompressible. In this case, the 
Cauchy stresses are in the form 


r = -pi + Ci 


27 2 /3 7 0 

7 ~7 2 /3 0 

0 0 —7 2 /3 


+ C 2 


7 2 /3 7 0 

7 -2j 2 /3 0 

0 0 7 2 /3 


where p is to be evaluated from the field equations. If there is no stress 
(r ,3 = S /3 = 0, i, I = 1,2,3) across the planes where z = constant, it 
requires that p — (C 2 — C x )^ 2 /3. Then the stresses become 


Ci7 2 (C x +C 2 ) 7 0 

(<Cl + c 2 ) 7 -Ci 7 2 0 

0 0 0 


We can easily compute the surface forces on any surface. For example, let 
us consider a surface originally at X = constant. The unit vector normal 
to the surface at the deformed state [Eq. (16.2:12)] is 


n = 


1 

\/l +7 2 


1 

-7 

0 


The surface traction acting over this surface is given by the vector 


T = r 


1 

\fi + 7 2 


-C 27 2 

( c x + C 2 ) 7 + C 2 7 3 
0 


The normal and tangential components of T are, respectively, 


[-C 2 7 4 - (C x + 2C 2 )7 2 ]/(1 + 7 2 ), (Cl + C 2 ) 7 /(l + 7 2 ) • 
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For the surface originally at Y = constant, the unit normal to its de¬ 
formed surface is still in the Y-direction. The surface traction is 

(Ci + C 2 ) 7 
-C 27 2 • 

0 

The normal and tangential components of the surface traction are, respec¬ 
tively, [-C 27 2 , -(Ci +C 2 ) 7 ]. The normal component is proportional to 7 2 
as compared to 7 4 for the formal case. 

More examples including simultaneous extension, inflation and torsion 
of a cylindrical tube, flexure of a circular cylinder, zero stress state of artery 
can be found in Green and Zenna (1968), Green and Adkins (1960), and 
Fung (1980). 

Anisotropic Materials. Isotropic materials are a special group in nature. 
Almost all biological tissues are anisotropic. Most crystalline materials 
are also anisotropic. An extensive and complete discussion of the various 
kinds of symmetry that can be expected in crystals and their reflection on 
constitutive equations is given by Green and Alkins (1960). An account 
of the constitutive equations of biological tissues is given in Fung (1993). 
Current literature can be found in Applied Mechanics Reviews. 



16.13. VARIATIONAL PRINCIPLES FOR FINITE 

ELASTICITY: COMPRESSIBLE MATERIALS 

In this section, we shall discuss variational principles for finite strain 
analysis of hyperelastic materials. An easly work is by Koiter (1973) on the 
complementary energy principle. A general variational principle can in¬ 
volve displacements, and the different strain and stress measures defined in 
Secs. 16.2-7 as independent fields. The following are frequently used re¬ 
lations from Sec. 16.11. Denoting the strain energy per unit undeformed 
volume as IVo(C), one obtains the following constitutive equation 


( 1 ) 


dW 0 _ OW 0 _ 1 _ 1 

dC dF T ■ F 2 b 2 blj£l£J ' 


where F is the deformation gradient tensor, C(= F r • F) is the metric 
tensor of the deformed body referred to the undeformed configuration [See 
Eq. (16.3:1)], and S is the second Piola-Kirchhoff stress tensor. These 
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quantities are defined in Eqs. (16.2:3), (16.2:7) and (16.7:6). Since 


( 2 ) 


dF T -F 

8 F 


d(FjiFjjeiEj) 

dF iK 


= ( Fi[8jK + Fij6iK)ei£jeieK , 


one can show that 


( 3 ) 


dw 0 

dF 


dWo 0F t -F i _ f _ . , 

g-p T ] F : - 0p~ = 2 s u( f h°JK + Fij6 IK )eiS K 

FijSjK&i£K = F • S = T t , 


which is the constitutive equation for T, the Lagrangian or first Piola- 
Kirchhoff stress tensor. In Eqs. (1) and (2), the base vectors must be in 
the order consistent with that of the subscripts of F’s as shown. 

We shall use Qo to denote the strain energy per unit undeformed volume 
when it is a function of U, the right stretch tensor defined in Eq. (16.4:4). 
This is to distinguish Qo from Wo, the strain energy density in terms of F 
or C. We define 

Qo( U) = W 0 (C) 

for the same deformation. Then, it can be shown that 


( 4 ) 


dQo 

dV 


(T • R), = r 


where the subscript (.)., denotes the symmetric part of the relevant quantity 
and R is the rotation tensor. Equation (4) is an alternative expression of 
the constitutive equation for T. 

Following Atluri and Cazzani (1995), we can derive a mixed four-field 
variational principle, which uses the displacement vector u(= x — X), the 
rotational tensor R, the right stretch tensor U, and the Lagrangian stress 
tensor T as independent fields. The functional is in the form 


(5) IIi(u, R, U, T) = [ [Q 0 (U) + T r : (I + V 0 u - R • U) 

JVo 

- p 0 h • u] dV - f N ■ T • (u - u )dA - [ T • u dA, 
Js u0 J S a Q 


where b is the body force per unit mass, Vo is the undeformed volume, N 
is a unit outward normal to the boundary surface dVo(= So), and S a o + 
S u q = dVo = Sq with S' CT o and S u o being the parts of the surface of dVo 

V 

where the traction T and the displacement u are prescribed, respectively. 
The admissible requirements for the four field variables are: u must be 
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continuous, R orthogonal, U symmetric and T a general tensor of rank 2. 
In Eq. (5), the gradient operator refers to the undeformed coordinates, i.e., 

v ° = £ i~sh- 

A four-field mixed principle having u, R, U and T as independent fields 
is stated as the stationarity conditions of the functional Hi. To derive a 
solution of the problem based on the stationarity conditions of the functional 
is called an integral formulation. Following the derivation in Secs. 10.1-3, 
the first variation of If! with respect to the independent field variables is 

(6) ffl, = j : <5U + ST t : [I + V 0 u - R • U] - T r : (<5R • U) 

- T t : (R • <5U) + T 7 ’ : V 0 <5u - p 0 b • Su] dV 

-f [N • JT • (u — u) + N • T • 6n\dA — f t SudA, 

JS^Q " •S'crO 

where 6(.) denotes the variation of the corresponding field variable, e.g., Su 
is the variation of u. Taking into account the properties of (:) as given in 
Sec. 16.1, one can show that 


(7) T t : V 0 <5u = V.(T.5u)-(V 0 -T)-$u, or 

„ dS Ui d(TjiSui) dTji 

Tl ‘axr~-exr-ex 7 S “‘' 

( 8 ) T t : (SR ■ U) = (T r • U • R T ) : (<5R • R T ) 

= (T t • U • R T )a : (<5R ■ R T ), 

(9) T t : (R ■ 5U) - (R r • T r ) : 6U 

= - (R t • T r + T • R) : <5U. 

The proof is left to the reader. A substitution of Eqs. (7)-(9) into Eq. ( 6 ) 
and the use of Gauss’ theorem yield 


SIR 


9Qo 

d\J 


-(T-R), 


: JU + ST t : (I + V 0 u - R • U) 


/.{ 

- (T t • U • R T )a : (JR • R T ) - (Vo • T + p 0 b) • Ju j dV 



N • JT • (u - u) dA - 



T-N-T) • SudA. 
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The vanish of <SIIi for arbitrary continuous <5u, antisymmetric <5R • R r , 
symmetric <5U and general tensor <5T gives 


(10) 

9 Qo _ (T tj\ 

W (T ‘ R)s 

(Constitutive law in Vo), 

(11) 

I + V 0 u = R U 

(Compatibility condition in Vo), 

(12) 

(T r • U ■ R r ) a 



= (R • U • T) a = 

0 (Angular momentum balance in Vo), 

(13) 

Vo ■ T + pob = 0 

(Equilibrium equation in Vo), 

(14) 

N • T = T 

(Traction boundary condition on S a o ), 

(15) 

u = u 

(Displacement boundary condition on S u q) . 


This variational principle is valid for general elastic materials. In this 
formulation both the displacement and traction conditions are natural 
boundary conditions (See Sec. 10.6). Note that the angular momentum 
equation requires R-U-T to be symmetric. As a matter of fact, R-U T(= 
tr) is the Kirchhoff stress tensor , which of course should be symmetric. 

One can obtain other variational principles by selectively enforcing 
Eqs. (10)-(15) a priori. For example, setting 

U = [R T -(I +V 0 u)] s 

in Eq. (5) will eliminate U from the functional and result in a variational 
principle with only three dependent variables u, R, and T. 

One can also establish a variational principle involving only the displace¬ 
ments as the field variable by requiring a priori 

I + VoU = R-U in Vo 

and 

u = u on S u o . 

Equation (5) then becomes 

(16) n 3 (u)= / {W 0 [F t • F] - p 0 b• u )dV - [ T-u dA, 

Jvo J s ff o 


wherein 


F = I + V 0 u. 



566 


LARGE DEFORMATION 


Chap. 16 


Equation (16) is the functional for the commonly known principle of min¬ 
imum potential energy , which involves only one field variable u. The asso¬ 
ciated field equations are 

Vo • + po b - V 0 • ( s • F t ) + p 0 b - 0 

in Vo, and the traction condition Eq. (14) on as a natural boundary 
condition. The displacement boundary condition Eq. (15) is now a rigid 
boundary condition (see Sec. 10.6). One enforces the rigid condition in the 
integral formulation and uses <fu = 0 on S u o in deriving the stationarity 
condition for II 3 . 

Atluri included R as an additional field variable by modifying II 3 in the 
form 

(17) n5(u, R) = J v {w 0 (F t • F) + I (R r • F )l - Po b • u} dV 



wherein 

(R r • F) 2 = (R t • F - F r • R) : (R t ■ F - F t - R), 

7 is a constant, and (.) a denotes the skew symmetric part of the corre¬ 
sponding tensor of rank 2. The constant is of the magnitude of the elastic 
modulus of the material. One can show that the variation of (R T -F ) 2 with 
respect to R and u gives 

<5(R t • F ) 2 = 4(F - R • F t • R) : V 0 (iu 

+ 4[(R t ■ F - F r • R) ■ F t • R] : (<5R • R) 

by using the properties of (:) operation described in Sec. 16.1. Applying 
Eq. (3), one can derive the field equations 

(18) Vq ■ [S • F r + 7 (F - R • F T • R)] + p 0 b = 0, 

(19) (F T • R ) 2 = (R r • F ) 2 , 

in Vo and the natural boundary condition Eq. (14) on S a o- The boundary 
condition Eq. (15) is a rigid condition. Equation (19) implies that R T • F 
is symmetric, i.e., 


(20) 


R T • F = F T • R or F — R • F T • R. 
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One sees that 7 is the constant factor in the penalty function approach. 
Atluri and Cazzani (1995) showed that the finite element solution based on 
II 3 provided better accuracy than that based on II 3 . 

One can also establish different three-field variational principles through 
contact transformation. For example, using the following contact transfor¬ 
mation 

(21) -Q c (r) = Oo(U)-(T-R) s :U, 

where r = (T • R) 3 is the symmetrized Biot-Lur’s stress tensor and Q c is 
called the complementary energy density in terms of r (see Sec. 10.9). One 
can show that Q c does not explicitly depend on U, i.e., 

dQ c dQo 

~mJ = au “ r = 0 - 

A substitution of Eq. (21) into Eq. (5) yields 


(22) n 4 (u,R,T)= f {—Q C [(T ■ R) s ] 4- T r : (I + V 0 u) — p 0 b • u}dV 

jVq 

— [ N-T-(u —u )dA- f T-udA. 

J S uO “ S a Q 


The admissible requirements for u, R, and T are the same as those of 
Eq. (5). The corresponding field equations of the functional are 

(23) ^=R t -(I + V 0 u) = U in F 0 , 

and Eqs. (12)-(15). Apparently, Eq. (23) is a compatibility condition for 
the new functional replacing Eqs. (10) and (11). 

One can establish variational principles using u, R, U, r* as indepen¬ 
dent fields, where r* is the Biot-Lur’e stress tensor defined in Eq. (16.7:8). 
For example, by substituting T by r* • R r into Eq. (5), one obtains 

(24) n 5 (u, R, U, r*) = / (Qo(U) + r* T : [R T • (I + V 0 u) - U] 

JVc 

-p 0 b • u }dV- [ N • r* • R T • (u - u) dA - [ T-udA. 

J S v 0 J S<,o 

One can derive other variational principles by imposing different conditions 
a priori as before. 



568 


LARGE DEFORMATION 


Chap. 16 


PROBLEMS 


16.11. Show Eq. (16.13:7) using the indicial notation. 


T t : Vo6u = Ta 


dS lit 
dXi 


dTuSui dTji . 

-dxr~ax1 5u< 


= Vo • (T ■ <5u) - (Vq • T) • 5u. 


16.12. Show Eq. (16.13:8) using the properties of the (:) operation. 

[Hint: R • R T = I and 3R • R r is antisymmetric.] 

16.13. Show Eq. (16.13:9) using the properties of the (:) operation. 

16.14. Derive the field equations including the natural boundary conditions 
for the functional given in Eq. (16.13:5) with U = (R 7 ■ (I + Vou)], satisfied a 
priori. Identify the admissible conditions for the independent field variables. 

16.15. Show that Eq. (16.13:23) is a field equation for the functional 
Eq. (16.13:22). 

16.14. VARIATIONAL PRINCIPLES FOR FINITE 

ELASTICITY: NEARLY INCOMPRESSIBLE OR 
INCOMPRESSIBLE MATERIALS 

For materials that are nearly incompressible, the energy density function 
Wo or Qo given in the previous section becomes ill defined and is not suitable 
for evaluating the mean stress. When the material is incompressible, one 
must use the constitutive law as defined in Eq. (16.11:18) or Eqs. (16.11:20) 
and (16.11:22). In this section, we shall discuss the variational formulation 
that is valid for compressible as well as incompressible materials by using 
the kinematic split formulation discussed at the end of Sec, 16.11. This 
is to multiplicatively decompose the finite strain into a dilatational part 
and a distortion one and to divide the stress into the deviatoric stress and 
the hydrostatic pressure. As an illustration we shall derive the variation 
principles involving the Biot-Lur’s stress tensor r*. 

Following Eq. (16.11:23), we split the right stretch tensor as U = U'J 1 / 3 
to give det(U') = 1, where J is the Jacobian, U' represents the distortion 
of the right stretch tensor U and J 1 / 3 the dilatation. If the strain is small, 
this decomposition becomes additive as that for the infinitesimal strain 
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case. Similar to the derivation in Sec. 16.11, we define 

ags(U'.J) _ *K(U',J) 

i r dU , , p- d j 

where Qq(U', J) = <3o(U) is the strain energy density function in terms of 
U. The relations among r*, r'*, p are 

( 1 ) (r*) s = J- 1 / 3 (r'*) s - i J— 1 / 3 [(r , *) s : UJU ” 1 - pJV ' 1 , 

( 2 ) -P=^[ (r *)- U ] = 5 (T: I)- 

Note that p is the hydrostatic pressure and r'* is not deviatoric since r'* : 
I/O. 

We can establish variational principles having u, R, U', J , r'*, p, T 

u 

as independent field variables where T is a traction vector defined only on 
S u o (Atluri and Cazzani 1995). For example, 

(3) n(u,R,U', J,r'*,p, T) 

= [ {Q5(U', J) + r'* T : [R r . (I + V 0 u)J “ 1/3 - U'] 

JVo 

— (J u — J)p — rho Q b • u} dV — f T ■ udA — f T ■ (u — u) dA 
J S a o J S u q 

is the functional for a variational principle of the seven field variables, 
in which 

(4) J u = det(U), U= [R t .(I + V 0 u)] s . 

The admissible requirements are: u is continuous, R orthogonal, U' sym- 

V 

metric, J positive, r'* a second rank tensor, p piecewise continuous and T 
piecewise continuous vector. 

To derive the field equations associated with the functional given in 
Eq. (3), we needs to take into account that 

v'* T : [5R r • (I + V 0 u)] = [(I + V 0 u) • r'*] : <5R 

= [R r • (I + V 0 u) • r'*] : (R T • 5R) , 
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r'* T : (R t • V 0 <5u) = (R • r'* r ) : V 0 du, 

SJ ’ - ^ : JU = J u i J - 1 ; <5U 

dU 

= Ju U _1 : [<SR T • (I + V 0 u) + R t • Vo<5u] 

= J u [R r ■ (I + V 0 u) • LT 1 ] : (R r ■ SR) + J^LT 1 • R T ) : ( V 0 Su) T . 


Prom the last equation above, one derives 

J^ 3 r'* T : [R t • (I + V Q u)JJ- 1/3 ] 

= -[r'* T : R t • (I + V 0 u)](U _1 • R r ) : (V 0 <5u) r /3 

— [r" T : R r • (I + V 0 u)][R t • (I + V 0 u) • U" 1 ] : (R T • «5R)/3. 

The stationary condition of II for arbitrary variation of the admissible field 
variables gives the following field equations: the constitutive laws 


( 5 ) 


( 6 ) 


(Os - 


dQ* 0 (U\J) 

dw 


0Q5(UV). 


the compatibility conditions 

(7) U' = R r -(I + V 0 u)J“ 1/3 , 

(8) J = J U ; 


the angular momentum balance condition 

(9) {R r • (I + V 0 u) • [3r'* - (r'* : (I + V 0 u) T • R) • IT 1 

+ 3pjy 3 U- 1 ]} a =0, 

[note that r'* : (I+V 0 u) r -R = r'* T : R T (I+V 0 u)]; the linear momentum 
balance condition 

(10) V 0 ■ - (J~ 1/3 r'* : (I + V 0 u) T • R/3 + pJ u )U~ 1 } ■ R T } 


+ p 0 b = 0, 


in Vb; the traction boundary condition 


{ 



(I + V 0 u) T • R 
3 J l J Z 



■R t 


V_ 

= T, 


(11) N- 
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on Sao, and the displacement boundary condition and the traction com¬ 
patibility condition 


( 12 ) 

(13) N' 


{! 


u = u, 
fr'* : (I + V 0 u f ■ R 




3 Ju 


1/3 


pd u 


u- 


R T = T, 


on S u o■ Note that since U' is symmetric, the compatibility condition assures 
that U = R T -(I +V 0 u) or U = (1 +V 0 u) r -R is symmetric and, therefore, 
equals to U(= [R T • (I + Vou)] s ). Consequently 

3jy 3 jy 3 

r'* : U 


-3jvr v ~‘ +pJU ~‘-jb! 

and Eqs. (11) and (13) become, respectively, 

N • { [r'* — ^( r/ * : u ) u_1 

N • { [r'* — ^(r'* : U )U -1 

One can obtain a five-field variational principle through the contact 
transformation 

(14) —< 2c[(r , *)*,p] = <3o(U', J) — U' : (r '*) s +pJ. 

A substitution of Eq. (14) into Eq. (3) yields 

r'* T : R r • (I + V 0 u) 


J~ 1/3 - pJU- 1 1 • R t = T on S aQ , 
J-1/3— pJU- 1 | r t = T on S u0 . 


(15) 


Ju 


n 8 (u,R,r'*,p, T ) = f v {~ «:((r'*).,P] + 

— J u p — Pob • u\dV — j T-u dA— I T • (u — u)cL4, 

J Js„ o Js u0 

which is the functional for a five-field principle. The admissible require- 

V 

ments for the independent field u, R, r'*, p, T are the same as those in II. 

We can similarly establish the variational principles involving the first 
Piola-Kirchhoff stress T. Let 


(16) 

(17) 


F' = F J~ 1/3 , 

Wo (F', J) = W 0 (F r • F). 
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Prom the last two equations, we can show that 


(18) 

(19) 


(TO 


rW 0 *(F', J) 
dF' 


9Wq(F', J) 

aj 


T T = J- 1/3 T /T _ 1 J-l/3j T ^ . Fjp-1 _ pJF~ X . 


One can then establish the variational principle using u, F', J, T', p T 
as independent field variables. 

One can also derive a two-field principle through the contact transfor¬ 
mation 


(20) — M / m(F / ,p) = Wq (F', J) + pj , 

The enforcement of compatibility conditions Eqs. (7) and ( 8 ) a priori results 
in 


F' = (1 + V 0 u)J-V 3 , 

J — det(I + V 0 u). 

Then in Eq. (3), replacing Q5(U', J) with 1F 0 *(F', J), enforcing the dis¬ 
placement boundary condition Eq. (12) on S u o, and introducing the penalty 
function 7 [R Z • (I + V 0 u)]q, one obtains 

(21) n 9 (u,R >P) = J | - W m (F',p) — pdet(I + V 0 u) 

+ 2 [R t • (I + V 0 u)]2 - p 0 b • uj dV - J T • u dA. 

The penalty function is introduced to improve the accuracy of the weak 
solution (Atluri and Cazzani 1995). The functional Fig for infinitesimal 
strain with 7 = 0 was derived by Tong (1969). 

Note that the function II of Eq. (3) cannot be used for incompressible 
materials because <3 q(U', </)[= <3o(U)] is not defined when J — 1. However, 
both 118 and II 9 of Eqs. (15) and (21) are suitable for incompressible and 
nearly incompressible materials. 

Before leaving this section, we summarize below the definitions of the 
various energy density functions and the associated constitutive laws de¬ 
veloped in the last two sections: 
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Energy density functions 

constitutive laws 

Qo(U) = Wo(C) = Wo(F t • F) 

dW 0 1 dW 0 _ tT 
dc 2 ’ dF 

w = < TR) - 

-Q c (r) = Q 0 (U)-(T-R).:U 

P 

II 

g$(U', j) = Qo(U', J) 

ags(U',J) 

( r )s QX5 , 

dQ*o(U\J) 
p - dJ 

= Q*o( U', J)-W: (r'*)j + P J 

U' = , 

d(r'*) a ’ 

9Q C 
p ~ dJ 

W 0 *(F', J) = W 0 (F' T • F'J 2 / 3 ) 

dWS(F',J) 

-' s QY' ’ 

aw?(F',j) 

p dJ 

-W m (F',p) = WZ(F',J)+pJ 

dW m (F',p) 

js Q F , 

T dW m (F\p) 

dp 


16.15. SMALL DEFLECTION OF THIN PLATES 

In the following two sections we shall consider the basic equations in the 
theory of isotropic elastic plates. Section 16.15 serves as an introduction by 
considering small deflection only. Section 16.16 treats the main subject of 
large deflection of plates. Prom the standpoint of engineering applications, 
the theory of plates is one of the most important and most interesting topics 
in the theory of elasticity. 

A plate is a body bounded by two surfaces of small curvature, the dis¬ 
tance between these surfaces, called the thickness , being small in compari¬ 
son with the dimensions of the surface. The thickness of the plate is denoted 
by h. The surface equidistant to the bounding surfaces is called the middle 
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surface or midplane. When h is constant, the plate is said to be of uniform 
thickness; when the middle surface is a plane in the underformed config¬ 
uration, the plate is said to be flat. We shall consider only flat plates of 
uniform thickness and of homogeneous linear elastic material. Our purpose 
is to illustrate the application of the concepts developed in Secs. 16.1 to 
16.11 to the derivation of basic equations for plates. 

A principal feature in straining a plate or a shell is the relative smallness 
of the traction acting on surfaces parallel to the middle surface as compared 
with the maximum bending or stretching stresses in the body. For example, 
the aerodynamic pressure acting on the wings of an airplane in flight is of the 
order of 1 to 10 pounds per square inch (7 to 70 KPa), whereas the bending 
stress in the wing skin is likely to range from 10,000 to 200,000 pounds per 
square inch (7 x 10 4 to 1.4 x 10 6 KPa). In many other applications, plates 
and shells are used to transmit forces and moments acting on their edges, 
and no load acts on the faces at all. This practical situation is important 
in simplifying the theory. When a plate is very thin, small traction on the 
external faces imply that the traction on any surface parallel to the middle 
surface is also small. 



Fig. 16.15:1. A thin plate and the coordi¬ 
nate system. 


A flat thin plate is depicted in 
Fig. 16.15:1. The x, y-plane co¬ 
incides with the middle surface of 
the plate in its initial, unloaded 
state, and the 2 -axis is normal 
to it. The thickness of the plate 
is h, which is much smaller than 
the dimensions of the plate in 
the plane of x, y. Under load, 
the plate deforms, causing elastic 
displacements u, (xt x , u y , u z ). 


In particular, |uj| -C h is considered. The strains are small. The constitu¬ 
tive equation is Hooke’s law. 

A general feature of the theory of plates and shells is that it is con¬ 
cerned with applications in which the stress components a zx ,cr zy ,cr zz in 
this coordinate system are very small throughout the plate. Hence, in the 
determination of the deformation of the plate, it is a good approximation 
to assume that 


(1) Ozx = 0, <7^=0, a zz = 0, —h/2 < z < h/2. 


The plate deformation is mainly stretching and bending, hence one may 
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assume that a xx and a yy vary linearly throughout the thickness, i.e. 
(2) a xx = ai(x,y)+bi(x,y)z, cr yy = a 2 (x,y) + b 2 (x,y)z. 


These assumptions are to be verified a posteriori for each boundary-value 
problem. With the assumptions listed in Eqs. (1) and (2), we have, by 
Hooke’s law, 


(3) 


e Z x — 


1 

2 


V dz + dx ) 


= o, 


e z y — 


1 

2 


f duy du z \ 
V dz + dy ) 


= 0, 


du z v . 

(4) £ zz — Qz ~ ~ \Gxx + Gyy) ■ 

Equations (2) and (4) together yield 

(5) u z = w(x,y) - ^ [ai(x,y) + a 2 (x,y)]z - ^ [bi(x,y) + b 2 (x, y)}^-. 

The function ta(r, y) represents the vertical deflection of the middle surface 
of the plate. In view of the thinness of the plate, the last two terms in Eq. (5) 
are in general small and will be neglected in comparison with w(x, y). 1 
Substituting w(x, y) for u z in Eq. (3) and integrating it, we obtain 


(6) u x = u(x, y) — z 


dw(x, y) 
dx 


u y =v(x,y)-z 


dw(x,y) 

dy 


u z — w(x, y) 


where u(x, y) and v(x, y) are the in-plane displacements of the midsurface 
(Fig. 16.15:2). 

To simply the discussion, we shall employ matrix notations and oper¬ 
ations in the remaining part of this chapter. From Eq. (6) we deduce the 
engineering strain matrix: 


(7) 


e — b- ZK, 


where e are the in-plane strain column matrix at the mid-surface and k is 
the curvature column matrix whose components are defined below: 


( 8 ) 


£ — \&x j 


du dv du dv 
dx dy dy + dx ' 


d 2 w 

dx 2 


d 2 w d 2 w 
dy 2 dxdy 


^The consistency of this statement has to be verified a posteriori. Note, however, that 
for a successful design using plates as part of a safe and stable structure, the stresses o xx , 
c vy must be bounded by the yielding stress, cty.p. , of the material. Hence, according to 
Eq. (16.15:4), e zz is bounded by 2 u<ty.p./E. Thus, the last two terms in Eq. (16.15:5) 
must be smaller than 2v(<TY.p./E)h. For most structural materials, (cty.p. /E) is of order 
10~ 3 . Hence the last two terms in Eq. (16.15:5) are negligible if u>/h 3> 10 -3 . 
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where the superscript (•) T denotes the transpose of the relevant quantity. 
The transpose of a column matrix is a row matrix. In the present definition, 
e, e, and k are no longer tensor. By means of Hooke’s law, we find the 
stress matrix 


( 9 ) 



= D(e — zk ), 


where 


( 10 ) 


D e 


E 

l-is 2 


1 is 0 

is 1 0 

0 0 (1 - v)/2 


is the elastic coefficient matrix for a homogeneous isotropic material. In 
this way, the stresses and deformations in the plate are determined on the 
basis of the assumptions Eqs. (1) and (2). 

The approximate results embodied in Eq. (6) are called Kirchhoff’s hy¬ 
pothesis, which states that every straight line in the plate that was originally 
perpendicular to the plate middle surface remains straight and perpendic¬ 
ular to the deflected middle surface after the strain. The theory of plates 
based on this hypothesis is called the Kirchhoff theory. 

In most cases the ad hoc assumptions Eqs. (1) and (2) are approxima¬ 
tions. In general, they are not even consistent. Kirchhoff alluded to the 
theory of simple beams for justification of his hypothesis; but we know from 
Saint-Venant’s beam theory (see Sec. 7.7) that the assumption “plane cross 
sections remain plane” is incorrect if the resultant shear does not vanish. 
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For these reasons many authors dislike these ad hoc assumptions, and have 
tried other formulations. A systematic scheme of successive approximations 
has been developed, of which the “first-order” approximation coincides with 
those of the Kirchhoff theory. The criticisms provide impetus for further 
developments of plate theory. However, in retrospect, the truth of the mat¬ 
ter is that the recognition of Kirchhoff’s hypothesis Eq. (6) was the most 
important discovery in the theory of plates. 

The Kirchhoff hypothesis reduces the equations of three-dimensional 
elastic continuum to those of a two-dimensional non-Euclidean continuum 
of the middle surface, describing the behavior of the plate in terms of the 
displacements (and their derivatives) of points on the middle surface. 

Let us now consider the equations of equilibrium. Since the approximate 
stress distribution throughout the thickness is already known [see Eq. (9)], 
we shall derive the approximate equations of equilibrium by integrating 
Eqs. (7.1:5) over the thickness. For small strain, the Cauchy and Kirchhoff 
stress tensors [see Eq. (16.7:3)] are approximately the same. We shall use 
<r’s to denote stress in the deformed configuration. We assume that over 
the faces of the plate ( z — ±h/ 2) there act the normal stress a zz (±h/ 2 ) 
and the shear stresses a zz (±h/2 ), a zy (±h/ 2). Figure 16.15:3 shows the 
directions of stress vectors, if the external loads a zx (±h/ 2 ), a zy (±h/ 2), 
a zz (±h/2) are positive. By integrating the stresses a xx , a xy , a yy through 
the thickness, we obtain the stress resultants N xx , N xy , N vy defined by 


(11) N x = j_ 


/■h/2 

j-h/2 

rh/2 

1 @XX dz j N xy = 

/ o xy dz , 

Sft 

b 

II 

a? 

J-h/2 

' —h/2 

J-h /2 


Now integrating the first two equations of Eq. (7.1:5) with respect to z from 
— h /2 to h/ 2 , we obtain 

< I2 > lt + d -W + '* = 0 ' ^ + W + ^ = 0 ' 

where /’s are the external loads tangent to the plate (forces per unit area) 

r h/2 


/ n/z 

box dz , 

-h/2 
fh/2 

fy = Vzyih/ 2 ) - o- zy (-h/ 2 ) + / b 0 y dz, 

J-h/2 


where b’s are the components of body force per unit volume. 
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Now if we multiply the first two equations of Eq. (7.1:5) by zdz and 
integrate from —hj 2 to h/ 2, we will obtain the results 


(13) 

where 


dM x ()M X 
dx dy 


- Q x + m x - 0 , 


dM x , 

dx 


dM y „ 

+ ^ Qy + rriy — 0 , 


(14) 

(15) 


M x 


rh/2 

/ o xx zdz, M x 

J-h/2 


*■-/ 


h/2 

h/2 

h/2 


j-h/2 

J-h/2 


cj xy zdz , M, 


r dz , 


h/2 

h/2 


rh/2 

v= a : 
J-h/2 


-,zdz , 


rn /2 

-y = / 
J-h/2 


,dz , 


h f fc / 2 

m x — - [a zx (h/2) + /i/2)3 + / b 0x zdz , 

* -'-/i/Z 

h r h/2 

rriy — — [o"3y(/i/2) “I - o'zy( /i/2)] T / boyzdz. 
1 J-h/2 


The quantities M x , M y are called the bending moment and Af xy the twisting 
moment; m x , m v are the resultant external moment per unit area about the 
middle surface; Q x , Q y , of the dimensions force per unit length, are called 
the transverse shear. Clearly, the moment arm is h/2 for shear on the faces 
and z in the plate. These stress resultants and moments are illustrated in 
Figs. 16.15:4 and 16.15:5. 
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(19) 


where 


M = 


M x 

M y 

M X y 


= -D b K , 


/ h/2 f'h/2 

D e dz, Gb— zD e dz, 

-h/2 J-h/2 

with D e given in Eq. (10). For homogeneous isotropic materials, 


D„ 


Eh 
1 - 1/ 2 


1 v 0 

v 1 0 

0 0 (1 - v)/2 


D b = D 


1 v 0 

v 1 0 

0 0 (l-v)/2 


D = 


Eh 3 

12(l-i/ 2 ) 


in which D is called the bending rigidity of the plate. 

Equations (17) and (19), involving the deflection w(x,y), define the 
so-called bending problem. Equations (12) and (18), involving the displace¬ 
ments u, v in the midplane of the plate, define the so-called stretching 
problem. An examination of these equations shows that in the linear theory 
of plates, under the assumption of infinitesimal displacements, the bend¬ 
ing and the stretching of the plate are independent of each other, i.e., no 
coupling between these two responses. It will be seen later that for large 
deflection, the bending and stretching become coupled and introduce non¬ 
linearity. A substitution of Eq. (19) into Eq. (17) yields the fundamental 
equation of the linear theory of bending of a plate, which is a fourth order 
equation in terms of w. 

For plates of uniform thickness, the equation becomes 


( 21 ) 


d 4 w d 4 w d 4 w _ 1 / dm x dm y \ 
dx 4 dx 2 dy 2 ^ dy 4 D y dx dy ) 


Thus the problem of bending is reduced to the solution of a biharmonic 
equation with appropriate boundary conditions. The mathematical prob¬ 
lem is, therefore, the same as that of the Airy stress functions in plane 
elasticity. The mathematical problem of plate stretching is identical with 
the plane stress problem, and Airy stress functions can be used. 
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16.18. LARGE DEFLECTION OF PLATES 

The theory of small deflections of plates was derived in the previous 
section under the assumption of infinitesimal displacements. Unfortunately, 
the results are valid literally only for very small displacements. When the 
deflections are as large as the thickness of the plate, the results become 
quite inaccurate. This is in sharp contrast with the theory of beams, for 
which the linear equation is valid as long as the slope of the deflection curve 
is small in comparison to unity. 

A well-known theory of large deflections of plates is due to von Karman. 
In this theory the following assumptions are made. 

(HI) The plate is thin. The thickness h is much smaller than the char¬ 
acteristic dimension L of the plate in its plane, i.e., h C L. 

(H2) The magnitude of the deflection w is of the same order of mag¬ 
nitude as the thickness of the plate, h, but small compared with the typical 
plate dimension L, i.e., 

|tu| - 0(h ), |iu| « L. 

(H3) The slope is everywhere small , \dw/dx\ <C 1, \dw/dy\ C 1. 

(H4) The tangential displacements, u, v are infinitesimal. In the strain- 
displacement relations, only those nonlinear terms, which depend on dw/dx 
and dw/dy are retained. All other nonlinear terms are neglected. 

(H5) All strain components are small. Hooke’s law holds. 

(H6) Kirchhoff’s hypotheses hold ; i.e., the traction on surfaces parallel 
to the middle surface are small, strains vary linearly with z, the distance 
from the midplane , within the plate, and normal remains normal. 

Thus, von Karman’s theory of plates differs from the linear theory of plates 
only in retaining certain powers of the derivatives dw/dx and dw/dy in the 
strain-displacement relationship. 

We now have a basic small parameter h. The elastic displacement w is 
assumed to be of the same order of magnitude as h. In other words, the 
term large deflection refers to the fact that w is no longer small compared 
with h. Thus, as shown in Fig. 16.16:1, the deformed configuration differs 
considerably from the original one. We cannot imitate the procedure of 
Sec. 16.15 without further care. For example, if we retain the rectangular 
coordinates, with the z-plane fixed as the original middle plane of the plate 
in the undeformed position, then the limits of integration across the plate 
thickness for a loaded plate can no longer be from —h/2 to h/2. If we wish 
to retain the convenience of fixed coordinates for the surfaces, we have 
two alternative courses. The first is to introduce “convective” coordinates, 
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Fig. 16.16:1. Corresponding stresses in the initial and deformed configuration of a plate. 


which become curvilinear in the deformed state, i.e., a set of curvilinear 
coordinates imbedded in the plate so that the middle surface is always the 
surface z — 0, and the faces of the plate z — ±h/2. The other is to adopt 
the Lagrangian description for the plate. In Lagrangian coordinates the 
plate surfaces are always designated as z = ±h/2. The second alternative 
is simpler, and will be pursued below. 

In the following we use the Lagrangian description and Green’s strain 
tensor , referred to the initial configuration. The strain components are 
[identifying Xi, X?, , X 3 with x, y, z] are given in Eq. (16.3:5). According to 
the ad hoc assumptions (H 6 ), we have the displacement components as given 
in Eq. (16.15:6). According to (H4) the higher powers of the derivatives 
of u and v are negligible in comparison with their first powers; and since 
|z| < It« L, it can be argued that the only nonlinear terms to be retained 
in the strain-displacement relations Eqs. (16.15:7) and (16.15:8) are the 
squares and products of dw/dx, dw/dy. Hence, 

(1) E = e + — zk , 

where e/v are the nonlinear part of the strain at the mid-plane, and 

/ dw \ 2 / dw \ 2 dw dw 
\dx) \dy) dx dy 

Conjugate to Green’s strain tensor is the second Piola-Kirchhoff stress 
tensor S [see Eq. (16.7:6)], which can be used in the Lagrangian description. 
We assume, according to (H5), that S is related to E through Hooke’s law, 



S = 


Syy 


= D ( e + T- ZK ) ’ 


or in terms of the stress resultants and moments 
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' K ‘ 



' M' x 

(3) N' = 

K 

l^vl 

= D„(e + e w /2), 

M' = 

W y 
,Ky . 


where D n and Dt, are given in Eq. (16.15:20) and the stress resultants N’s 
and the moments M’s are defined as 


(4) 

K = 

M2 

1 S xx dz, 

l-h/2 

II 

M2 

1 S xy dz , N' = 

-h/2 

rh/2 

I Syydz , 

l-h/2 

(5) 

ii 

S’ 

M2 

I S xx zdz , 

l-h/2 

£ 

II 

f h/2 

/ S X yZ(lZ , My 

J — h/2 

M2 

~ / SyyZdZ 

J-h/2 


The prime indicates that N’s and M’s are based on the second Piola- 
Kirchhoff stresses and are referred to the initial, unloaded configuration of 
the plate. 

The stress tensor S, the resultants N' x , N' N' xy , and the moments M' x , 
My, M' are defined with respect to the original, unloaded configuration 
of the plate. Since all partial derivatives u, v, w are small in comparison 
with unity, the values of S, N' x , N' y , N xy , M x , M y , M xy are in the first 
order approximation equal , respectively, to the Cauchy or Kirchhoff stress 
tensor cry, the stress resultants N x , N y , N xy , and the moments M x , M y , 
M xy defined in the deformed configuration of the plate. See Eq. (16.7:10) 
which relates the Kirchhoff stress tensor cry to the second Piola-Kirchlioff 
stress tensor Sy. We will drop the prime in the subsequent discussion. 

In the Lagrangian description, the condition of equilibrium is given by 
Eq. (16.10:7) 



in indicial notation, where b’s are the body force per unit volume. Identi¬ 
fying Xi,X 2 ,Xs with x,y,z and identifying xi,X 2 ,x 3 with x -f u x , y + u y , 
z + u z , we obtain, on retaining only the first-order terms (i.e., neglecting 
any product terms that include u ox v), the equations become 


( 6 ) 

(7) 


9S XX dS xy 9S ZX 
dx dy dz 


4 _ a _ n 9S xy dS vy dS zy 

+ 6°x-°’ -g^ + ~d^ + -d^+ b Oy-0, 


(s — 

V Xx dx 


■+■ S x 


dw 
1 dy 


+ S zx ) + JT- 

dy 


S —+S 
xv dx y 


— + S ^ 
'd y +S *»J 


d (_ dw n dw \ , 

+ dz + Szv d^ + Szz ) +bo * = 0 ’ 

Note that u x , u y , u z are given in Eq. (16.15:6). 
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The rest of the development of the large deflection theory parallels 
closely that of the classical linear theory. Hence, our presentation will 
be brief, and we refer to the preceding section for many details. If Eq. (6) 
is multiplied successively by dz and zdz and integrated from — h/2 to h/2 
and if only the first order terms are retained, the governing equations for 
the membrane stress resultants iV’s and the moments M’s are the same as 
Eqs. (16.15:12) and (16.15:13) with cr^j replaced by S l} in the formulas for 
the f’s, Q’s, to’ s, and M’s. 

Finally, an integration of Eq. (7) with respect to z from —h/2 to 
h/2 gives 


( 8 ) 


d_ 

dx 


Qx + N x 


dw 

dx 


+ N X y 


dw\ 

dy) 



Qy + N x y— + Ny 


dw\ 

dy) 


+ p — o 


where p' is the lateral load per unit area (of the undeformed middle plane) 
acting on the plate, 



The last two terms on the right-hand side of Eq. (9) are the familiar lat¬ 
eral load. The first and second terms represent contributions to the lateral 
load due to shear acting on the surfaces that are rotated in the deformed 
position. Equation (8) is substantially different from the corresponding 
Eq. (16.15:16) of the small-deflection theory. We can eliminate Q x , Qy 
between Eqs. (8) and (16.15:13) to obtain 

d 2 M x d 2 M xy d 2 M y _ dm x dm y 

dx 2 dxdy + dy 2 ^ dx dy 



Finally, a substitution of Eq. (3) into the above gives, for a plate of 
uniform thickness, 


(10) A 


criu d 4 w 
dx 4 dx 2 dy 2 


+ 


dy 4 D 


P + 




JL 

dx 


N x ^f + N x 

OX 


dw , 


dx 

d_ 

dy 


N x 


dy 

dw „ r dw 
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A substitution of Eq. (3) into Eq. (16.15:12) yields immediately the ba¬ 
sic equations for u and v. Alternatively, a stress function F(x, y) can be 
introduced that 


N x = 




N X y = 


d 2 F 

dxdy 


satisfy Eq. (16.15:12) identically, where the P’s are the potentials of the 
tangential forces given by the indefinite integrals 


(11) P x = j fxdx, Py = J fy dy. 

The compatibility condition can be derived by eliminating u and v from 
Eq. (3). Using Eq. (11), we obtain, for a plate of uniform thickness, 


(12) A 


d 4 F n d 4 F d 4 F 
dx 4 dx 2 dy 2 ^ dy 4 


f d 2 w ^ 

2 d 2 w d 2 w 

V dxdy , 

dx 2 dy 2 


d\Py-UP x ) d 2 (P x - vPy) 

dx 2 dy 2 


Equation (10) and (12) are the famous von Karman equations for large 
deflection of plates. 

In most problems the external loads m’s, and P’s are zero. We then 
have the well-known pair of equations 2 


(13) A W 4 w 

(14) A V 4 P 


W d 2 F d 2 w d 2 F d 2 w d 2 Fd 2 u A 

D\ P+ dy 2 dx 2 dxdy dxdy dx 2 dy 2 ) 


/ d 2 w \ 2 d 2 w d 2 w 
\dxdy) dx 2 dy 2 


where V 4 is the two-dimensional biharmonic operator. 

The most important features of Eqs. (13) and (14) are that they are 
coupled and nonlinear. The nonlinear terms are fairly easy to interpret. 
Equation (8) shows that the membrane stress N x , N y , N xy multiplied by 
the corresponding curvatures and twist, d 2 w/dx 2 , d 2 w/dy 2 , d 2 w/dxdy re¬ 
spectively, are equivalent to a lateral force. The nonlinear terms in Eq. (10) 
or (13) represent such feedback of induced membrane stress on bending. On 

2 These equations were given without proof by von Karman in 1910. Most books give 
derivations of these equations without a clear indication as to whether Lagrangian or 
Eulerian descriptions were used. The explicit use of Lagrangian description introduces 
a degree of clarity not hitherto achieved. 
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the other hand, the reader may recall from differential geometry that if a 
surface is represented by the equation 2 = w(x,y ), the total or Gaussian 

curvature of the surface is given by the quantity ■ The 

Gaussian curvature of a surface is equal to the product of the principal 
curvatures at a point. The Gaussian curvature vanishes if the surface is 
developable. Hence, the right-hand side of Eq. (14) vanishes if the deflec¬ 
tion surface is developable. If a flat plate is deformed into a nondevelopable 
surface, its middle surface will be stretched in some way and the right-hand 
side of Eq. (14) does not vanish. Thus, the nonlinear term on the right-hand 
side of Eq. (14) arises from the stretching of the middle surface of a flat 
plate due to bending into a nondevelopable surface. When the nonlinear 
terms are neglected, these equations reduce to the corresponding equations 
of the small-deflection theory. 

There are a large number of papers on the theory of plates. Some are 
devoted to justifying the ad hoc assumptions (HI)-(H6), or to their removal 
or generalization; others are concerned with the solution of boundary-value 
problems. See Bibliography 16.14. We shall conclude our discussion here, 
merely pointing out that the mathematical structure of the in-plane de¬ 
formation of plates in small deflection is the same as that of the plane 
problems of linear elasticity, whereas the large-deflection equations are es¬ 
sentially nonlinear. The theory of large deflection of plates includes many 
interesting problems on the stability, responses, vibrations, subharmonic 
resonances, edge layers, etc. The explicit introduction of both Eulerian 
and Lagrangian descriptions of stress and strain clarifies the foundation of 
the large-deflection theory of plates. Readers are referred to literature for 
the theory of shells, e.g. Atluri (1984). 

Problem 16.16. Section 16.9 establishes that the displacement vector u, 
the Almansi strain tensor e, the metric tensor related the deformed body to 
the undeformed body B -1 , the left stretch tensor V, the symmetric part of the 
deformation rate tensor D, and the Cauchy and Kirchhoff stress tensors T and <T 
are objective and that the velocity vector v, the Green strain tensor E, the metric 
tensor related the underformed body to the deformed body C, the deformation 
gradient F, the rotational tensor R, the right stretch tensor U, the Lagrange stress 
tensor T, the second Piola-Kirchhoff stress tensor S, the Biot stress tensors r*, 
r, the corotational stress tensor ov the velocity gradient L, the spin rate 12, and 
all stress rates are not objective. Work out the details to prove the results. 
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INCREMENTAL APPROACH TO 
SOLVING SOME NONLINEAR PROBLEMS 


The field equations of the mechanics of solids subjected to large defor¬ 
mation and finite strain are nonlinear with respect to displacements and 
velocities. The constitutive equation is linear or nonlinear depending on the 
material. The boundary conditions could be nonlinear if the external forces 
and the displacements of particles on the boundary are coupled in a certain 
nonlinear way. The external loading may depend on the elastic deformation 
as in aeroelasticity or in some special structural problems. These features 
may result in a nonlinear boundary-value or initial-value problem, which is 
difficult to solve. One observes, however, that large deformation or motion 
are arrived at through many infinitesimal steps, and for each infinitesimal 
step it is possible that the linearized equations prevail in some sense. If 
we can solve the linearized equations and keep track of all the infinitesimal 
steps, then we can solve the nonlinear problem. This is the basic idea of the 
incremental approach. The mathematics of the incremental approach will 
be described in this chapter. The incremental approach is used not only 
for problems of elasticity. For some problems of plasticity, visco-elasticity, 
and biomechanics, it is also a natural approach because sometimes the 
constitutive laws of the materials involved in these fields can be described 
only incrementally. 

In this chapter, we shall derive the linearized equations for the incre¬ 
ments of deformations and stresses, the linearized equations of motion in 
terms of these increments, and the corresponding variational principles for 
integral formulation. Computational methods for solving these linearized 
incremental equations will be discussed in the sequent chapters. Finite el¬ 
ement modelling of nonlinear elasticity, vicoelasticity, plasticity, viscoplas¬ 
ticity, and creep is discussed in Chapter 21. 

17.1. UPDATED LAGRANGIAN DESCRIPTION 

The following derivation follows mostly that by Atluri and Cazzani 
(1995). We describe the deformation as a multi-increment process whose 
evolution is described by means of a parameter t, which can actually be time 
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as in dynamic or creep problems but often it is just the loading magnitude 
of a proportional load. Implicit in the assumption is that the solution is 
independent of the rate of change of the parameter. Let Dq be the reference 
configuration with region Vo occupied by the body at t = to and Di, D%, ..., 
be the sequence of configurations occupied by the body at t = t\,t 2 ,..., 
respectively. Thus, Di represent the configurations at different states of 
deformation. We shall use Dn to denote the current configuration, D^ + \ 
the configuration at the subsequent increment, and D 0 the undeformed 
state. The increment approach seeks the incremental solution at t = t#+i 
with the solution at t n known. We shall use bold characters such as A j (X) 
as a tensor or vector, and non-bold italic characters such as Aj (X) for scalar 
at the material point X. The superscript J and the subscript I denote that 
the concerned quantity is in the Dj configuration (i.e., at t — tj) but 
referred to the configuration D; (i.e., the configuration at t = tj). For 
example, we use u/(X) to describe the relative displacement vector at the 
material point at X between the deformation state tj and tj. As another 
example, the first Piola-Kirchhoff stress tensor T as defined in Eq. (16.7:4) 
is a stress measures in the current configuration (i.e., at t = t^) referred to 
the undeformed corrdinates. Hence, we have T = Tq^X) as the subscript 
0 refers to the undeformed configuration and the superscript N denotes the 
current configuration. On the other hand, Tj v is the first Piola-Kirchhoff 
stress tensor defined according to the Lagrangian rule Eq. (16.7:15) with 
reference surface P 0 Qo Ro So as shown in Fig. 16.7:1 being the surface 
at the deformation state t — tj, i.e., the Dj configuration. In subsequent 
discussion, for simplicity, we shall drop the explicit reference to the material 
point X by simply writing A/, Aj for A/(X), Aj (X), respectively, except 
occasionally use X to emphasis clarity. 

In Sec. 5.2, the material and spatial descriptions of changing configura¬ 
tions of matter are discussed, and the associated Lagrangian and Eulerian 
variables are considered. In solid mechanics, the material or Lagrangian 
description is used more often. In the incremental approach, one further 
distinguishes the total Lagrangian description and the updated Lagrangian 
description. The former refers all quantities of the latest increment to the 
reference configuration Do , denoted as A ( v+1 or Aq +1 , while the latter 
refers quantities to the current configuration, denoted by A $ +1 or Ay +1 . 
From computational point of view, the use of the updated Lagrangian de¬ 
scription requires the updated configuration at each increment, whereas the 
total Lagrangian description needs no configuration update but leads to a 
much more cumbersome expression. The updated Lagrangian description 
is often preferred in practical applications and is used in the subsequent 
discussion to establish the linearized equations. 
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Consider a quantity A, which associates with a parameter t. In the 
updated Lagrangian description, the material rate of the quantity corre¬ 
sponding to the parameter at t — t^ is defined as 


(1) A 


Aff +1 (X)-Afl(X) 
A A t N 


where A can be a tensor, a vector or a scalar (if it is a non-boldfaced italic 
character) and A t N (— fjv+i — tv) is assumed to be small. In subsequent 
discussion, for simplicity, we shall drop the explicit reference to the material 

O 

point X and the deformation state N by simply using A and At except 
occasionally include X and N to emphasis clarity. Then 


( 2 ) 


A# +1 = A# + A At. 


The incremental approach is to determine A approximately by lineariz¬ 
ing the governing equations for all material rates at Dn and evaluate the 
incremental solution according to Eq. (2). 

On the other hand, the material rate in the total Lagrangian description 
is defined as 

DA$(X) _ A» + 1 (X)-A»(X) 

{ 1 Dt At* 

The material rates in the total and updated Lagrangian descriptions are 
related. The relationship depends on the definition of A. We shall examine 
the relationship for specific cases in subsequent sections. 

If A represents a strain measure and t is actually the time, than in the 
limit of tAi+i —► tjv, the material rate described in Eq. (3) is the strain rate 
A defined in Sec. 16.5. Thus, we may approximate the material rate 

DAp (X) „ : 

Dt 

when At is small but finite, and write 

(4) A^ +1 = A^ + AA t. 

O 

In general, the material rate A in the update Lagrangian description 
differs from the material rate A. Especially the definition of stress rate in 
the updated Lagrangain description needs much more care. Since stress is 
defined by a force vector and a surface, one must consider both the rate of 
change of the force vector and the rate of change of the associated surface 



590 


INCREMENTAL APPROACH TO SOLVING SOME . . . 


Chap. 17 


with respect to the parameter when one attempts to define a stress rate. 
In fact, stress rate can be defined in a number of ways. By introducing 
the parameter t in the updated and total Lagrangian descriptions, one 
liberalizes but not necessarily simplifies or settles the matter. The simplified 
linearized case will be discussed in Sec. 17.3. 


17.2. LINEARIZED RATES OF DEFORMATION 

We shall use the notations of Sec. 16.2. Let u denote a relative dis¬ 
placement vector. Then, u$ is the relative displacement vector in the 
same deformation state N and is therefore zero. Hence 


( 1 ) u#= 0 . 

Fjy is the deformation gradient tensor associated with the displacement 
field UjY and is then an identity, 

(2) F"=I. 


From the definition in Eqs. (16.2:5) and (16.4:4), we find 

(3) Jfh — det(F$) = det(I) = 1, 

(4) Ujy = R$ — I. 


In deriving the approximate material rates in the updated Lagrangain de¬ 
scription, we assume that A t is small. The rate of change of displacement 
u is the velocity-like vector denoted by 


(5) 


There is no definition for an acceleration-like vector in the updated La¬ 
grangian description. In subsequent discussion, we shall use “=” and 
to denote definition and approximation. 

In general, since the spatial coordinates of quantities are referred to the 
current configuration in the updated Lagrangian description, the material 
rate of a strain measure is equivalent to the corresponding rate in the Eu- 
lerian description. Thus, one can determine the rates of deformation in 
the updated Lagrangian description from the rates given in Secs. 16.5 and 
16.6 by replacing the undeformed state with the current one as the refer¬ 
ence configuration. From Eqs. (2) and (16.5:3), (16.5:5) and (16.5:6), the 
updated Lagrangain rate of the deformation gradient tensor is 


(6) A 


F = F# = (L • F)# = L# • F# = L = D + ft 
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where L is the velocity-like gradient and D and ft are the symmetric and 
antisymmetric parts of L. Recall that r, L, D, ft and all rates (?) are 
referred to the current configuration D n and that F, R and J are the cur¬ 
rent deformation gradient tensor, rotation tensor and Jocobian associated 
with zero displacement. Equation (6) can be derived also directly from 
Eq. (17.1:1), 


£ _ _ * N 

F = - At - 


d{x N + u% +1 ) 

dX N 


At 


8 u $ +1 3v 

dX N At dx N L 

Since F = R ■ U, one has 

(7) F = (R • U + R • U)jy = Rjv ' U+ R • U$ - U+R . 

Notice that the decomposition of the deformation gradient rate is additive 
as opposed to the multiplicative polar decomposition for finite deformation 
given in Eq. (16.5:3). Since R • R r = I, its deformation rate is zero, 

R -(R#) t + R %■ R T =R + R r = 0 , 

O 

which implies that R is antisymmetric. Since U is symmetric, then 

a- uff +i -u % 

At 

O O 

is symmetric. We call U the updated Lagrangian strain rate tensor and R 
the updated Lagrangian spin tensor. A comparison of Eqs. ( 6 ) and (7), and 

O O 

the use of the symmetric and antisymmetric properties of U and R give 

(8) 4 U*^D = (L) S , 

(9) A R^fi = (L) a , 

where (■)„ and (•)<, denote the symmetric and antisymmetric parts of the 
revelant quantity. Thus, the updated Lagrangian strain rate and the updated 
Lagrangian spin rate tensors are the symmetric and antisymmetric parts of 
the velocity-like gradient tensor. Similarly, from Eq. (16.6:6), one finds 

( 10 ) ▲ J = tr(L) Jjy = tr(L) = tr(D). 

One can also show that the Cauchy and Almansi tensor rates are 
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The use of the definitions above gives the updated solution of various 
quantities 

(11) F$ +1 = I + FAi = I + LAf, 

(12) U $ +1 = I+ UAt = I + DAt, 

(13) R $ +1 = I+ RA£ = I + fiAt, 

(14) u# +1 = u At = vAt, 

(15) J # +1 = det(F# +1 ) = l + JAfSl+ tr(D)At 

in terms of their updated Lagrangian material rate. Since F = I + Vu, one 
has 

(16) F # +1 =1 + Vivu $ +1 =1 + V N v&t, 

where V^r is the gradient operator defined in the current configuration. 
Comparing Eq. (16) with Eq. (11), one finds 

(17) L = V w v. 

Note that L, D, fi and all variables (•) are referred to the current configu¬ 
ration Dm and that F(= I), R(= I) and J(= 1 ) are the current deformation 
gradient tensor, rotation tensor and Jocobian in the updated Lagrangain 
description. 

With X N denoting the material point in Dm, its corresponding point 
in Dm+i is 

(18) X N+1 = X N + u$ +1 , 

which is the updated coordinates for the next increment. 

The gradient operation V# is calculated as follows. If the current 
configuration is characterized by generalized coordinates (rf) N with cor¬ 
responding contravariant base vectors (g*)' v , the gradient operator is in 
the form 



v »- a( ,V gr 

(N not summed). 

Then 



(19) 

v»v= ^„( g r 

(N not summed), 


dvj 

~ eje 'd(Xi)» 

(in Cartesian coordinates), 
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where e, are the base vectors of the Cartesian coordinates. Note that in 
general eje t ^ e^ej for i / j. 

One can easily establish the following relations between the update La- 
grangian quantities and those referenced to the undeformed state: 


(19) u^ + 1 =u^+vAf, 


dx N du % +1 dX N 


( 20 ) f " + 1 = ^t + 5 x* 


dX° 


= (I + LAO ■ K , 


(21) C 0 " +1 = (F^ +1 ) t • Fq ,+1 = (F$) t ■ [I + 2DAt + L r • L(At) 2 ] 

(22) R^ +1 = + (R^ +1 - R^) = (I + Cl At) ■ Rq , 

(23) U^ +1 =R^ +1 • F^ +1 = [I + (Rq) t ■ D • Rq At] • . 


In Eq. (22), the second order term is retained to derive the constitutive 
laws in Sec. 17.5. Equation (20) is by definition and Eqs. (21) and (22) 
follow. Equations (23) and (24) are approximate. 


17.3. LINEARIZED RATES OF STRESS MEASURES 

The Cauchy stress tensor in the current configuration is t$ . For 
simplicity, we shall drop the super- and subscript N and denote it as r. 
The Kichhoff stress tensor er, the first and second Piola-Kirchhoff stress 
tensors T and S, and the Biot-Lur’e stress tensor r* referring to Dn are 

(1) = t, 

(2) T$ = ^n(F^) - 1 ■ = r, 

(3) S"=T".[(F-0X = r, 

(4) rtf = Jn(Fn) 1 ’ t n‘R-n = t - 


In other words, all updated Lagrangian stress tensors in the current con¬ 
figuration are equal to the Cauchy stress tensor. We shall determine the 
material rates of these quantities in the updated Lagrangian description. 
First, consider the first Piola-Kirchhoff stress tensor T whose material rate 
in the updated Lagrangian description is 


which gives 

(5) 


t n+i _ T 

At 


N 

N_ 


T # +1 = r + T At. 



594 INCREMENTAL APPROACH TO SOLVING SOME . . . Chap. 17 

From T‘^ +1 , one can evaluate the updated Cauchy stress tensor +1 in 
the (N + l) th configuration. Since the Cauchy stress tensor is independent 
of the reference configuration, we can simply denote as t n+1 . Using 

Eq. (16.7:21), we have 


N + l _ tjiAI + 1 rj-iN + 1 / tJV + 1 
T ~ r N ' *-N / J N 

In the linearized form , the Cauchy stress in the incremented configuration 
becomes 

( 6 a) t n+1 a (I + LAf) • (r + T At)/[1 + tr(L)At] 

= t + [T+L-t — tr(L)r]At, 

which will be used to determine the solution for the next increment. Since 
r is the limit of the solution of the following equation as At —► 0 , 

( 6 b) / + 1 =r+rAt, 

we can equate Eqs. ( 6 a) and ( 6 b) to determine the Cauchy stress rate r in 
the updated Lagrangian description. We obtain: 

(7) rST+L-r—tr(L)r, or T = r -L • r + tr(L)r . 

By this choice, we adopted a definition of the stress rate r from among many 
possible definitions mentioned at the end of Sec. 17.1. Equation (7) can be 
obtained also directly by expressing the material derivative of the Cauchy 
stress tensor in the current configuration. Since r = F-T/J [Eq. (16.7:22)], 
one derives 

t= F# • T/J%+ F ■ T - F# ■ T%°//{ J #) 2 =T +L-r — J r. 

Because r N +1 , r and tr(L)r are symmetric, so is T +L • r, hence the 
antisymmetric part of T + L ■ r vanishes: 

( 8 ) (T + L ■ r ) 0 = 0. 


With r as defined in Eq. (7), we can determine the updated Lagrangain 
rates of the Kirchhoff stress tensor <r(= Jr), the Biot-Lur’e stress tensor 
r*(= F _1 - r-RJ), the second Piola-Kirchhoff stress tensor S[= T • (F -1 ) 7 '] 
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and the Green Naghdi stress tenor cr r (= JR 7 • r • R). On examining 
the equations 

<r£ +1 =r +£ At = 

3 (1 + J At)(r + r At) 3 r + (J r + r)At, 
r^ +1 = r + r*A t = (F _1 )$ +1 • t n+1 ■ K% +1 J# +1 

3 (I - LAf) • (r + r At) • (I + RAt)[l + tr(D)Atj 
3 r + [r + r • tr(D) - L • r + t • fi]At, 

S$ +l = r + S At = T$ +1 • (F _1 ) t n +1 

3 (r + T At) • (I - L T At) = r + (T - r • L T ) At, 

= (I + H r At) ■ (t + r At) • (I + OAt)[l + tr(D)At] 

= r + (t -fft 7 ■ r + r • fi + tr(D)r)At, 
one can derive the following rates of stress measures: 

(9) A a = t + J r =t + tr(D)r, 

(10) A r* 3 r — L • r + tr(D)T + t • fl, 

(11 ) ▲ S=T - t-L t =t — L-t — t ■ L t + tr(D)r . 

(12) ▲ o’r 3 r — fl • r + t • fl + tr(D)r . 

Note that tr(D) = tr(L). From Eqs. (7) and (10), one establishes the rela¬ 
tion between the rates of T and r* in the updated Lagrangian description as 

r * =T + r • fi. 

Also from Eq. (10), since r and r are symmetric, 

(r * + L • t — r • n)„ = (r * + D ■ r) 0 = 0. 

The updated Cauchy stress t n+1 (= t n + r At), is evaluated from 
Eq. (6b) and the stress rate r in the updated Lagrangian description is 
evaluated from Eq. (7), or Eqs. (9)-(12). The choice of the specific expres- 

o ° o ° 

sion for r will depend on which of these stress rates T, r * and S is directly 
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available. Depending on the solution approach, one solves for T, r* or S 
directly and then determines t. If one uses T or r*, one needs the value 
of the spin rate fi, i.e. the antisymmetric part of L. The accuracy of fi 
strongly affects the accuracy of the overall solution (Iura and Atluri 1992). 
Details will be discussed later in the numerical implementation sections. 

Note that a Tjy, S$, rall equal t^(= t n = r), and therefore 
differ from erf/, Tq , Sq , Tq N , respectively, with the latter referring to the 
undeformed coordinates. However, we have t$ = t ( /, because the Cauchy 
stress tensor is the stress at the deformed state Dm and is independent of the 
original configuration. This distinction is important for the determination 
of the constitutive relations. The rates of the stress measures in the total 
Lagrangian description are simply those of Tq V , erf/, tff, Sff , rf N . The 
rates of r in both descriptions are the same because rff = t// , i.e., r= r. 
Prom Eq. (16.7:3), the rate of erf/ is 


Per 

Dt 


t Dt DJ T ., ._. , 

= J oT + m r = JlT + tr(D)T] ' 


in which the rate for J in Eq. (16.6:6) has been used and the superscript “jV” 
and the subscript “0” have been dropped to make the notation consistent 
with Sec. 16.6. One can also obtain the material rates of T^, S^, Tq N by 
taking the time derivatives of Eqs. (16.7:4,6,8). The results are given in 
Eqs. (16.9:17)-(16.9:20). Comparing them with Eqs. (7) and (9)—(12) with 
t~ f, we obtain 


(13) 

(14) 

(15) 

(16) 
(17) 


T — F 


DT 1 
Dt J ’ 


a 


Dcr 1 
~Dtl' 


r* 

S 


= F • 

= F 


Dr* 
~Dt 
PS 
~Dt ' 


• R r 


1 

7’ 



Dt 


which give the relation between the time rates in the updated Lagrangian 
description and those in the total Lagrangian description. For materials 
which have well defined strain energy (called hyper-elastic materials), the 
energy density function is often in terms of strain measures referred to the 
undeformed state. The material rates of stresses in the total Lagrangian 
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description can be expressed in terms of the derivatives of the energy density 
function with respect to deformation. In turn, which will be shown later, 
we can determine the material rates of stresses in the updated Lagrangian 
description from the derivatives of the energy density function. 

Recall that r, L, D, ft, F, R, J and all concerned variables (?) refer 
to the current configuration Dpi and that F,R and J are the deformation 
gradient tensor, the rotation tensor and the Jocobian. 

Problem 17.1. Prove that dF~ l /Dt = —F~ 1 L where F is the deformation 
gradient tensor and L is the velocity gradient tensor. 

17.4. INCREMENTAL EQUATIONS OF MOTION 

The equation of motion in terms of the Cauchy stress in Dpi is 

(1) Vyy • r + ppib N = 0, 

where Vjv- is the divergent operator, ppi the density and b N the sum of the 
body force and the inertia force per unit mass referring to the deformation 
state in Dp/- The inertia force is mass time acceleration, and is a function 
of the rigid body motion and the deformation of the body. The acceleration 
must be updated in real time. Equation (1) is the equation of equilibrium, 
which, when expressed in terms of T^ +1 in Dpi, becomes 

(2) VrT^+rf^O. 

Using b$ +1 = b w 4- b At and T$ = t n , one gets 

V]v(t + T At) + p N (b N + b At) = 0, 


or 

(3) A Vjy-T 4 - ppi b— 0 , 

where b is the body force in the updated Lagrangian description. Equa- 

O 

tion (3) is linear in T and is similar in form as the equation of motion in 

terms of T$ +1 in Eq. (2), but b can be a nonlinear function of strain, strain 
rate, displacement and velocities. When the system is in equilibrium so that 
the inertia force vanishes, then the effect of large deformation appears only 
implicitly in the equilibrium equation through the gradient operator V^r. 
A substitution of T$ +1 = r^ +1 • (R$ +1 ) T into Eq. (2) yields 

V N • [r*/ +1 ■ (R% +1 ) t ] + PNb" +1 = 0. 
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Using r*^ = r and R^ +1 from Eq. (17.2:13), one finds 

V/v ■ [(t+ r*A t) ■ (I + fIA£) T ] + p/v( b N + b At) — 0. 

Expanding and linearizing the equation above and taking into account that 
ft T = —fl, one obtains 

(4) V/v • (r* - r • fl) + py b= 0, 

the equilibrium equation for the Biot-Lur’e stress rate. Equation (4) shows 
explicitly the effect of large deformation through the term r ■ ft. The 
large deformation also affects the equilibrium equation implicitly through 
the gradient operator Vy. Similarly, the replacement of T by S • F T in 
Eq. (2) 

Vy • [(r + S At) • (I + L T A<)) + p N (b N + b At) = 0 , 


gives 

(5) A Vy ■ (S + r • L t ) + pjv b= 0 , 

the equilibrium equation for the second Piola-Kirchhoff stress rate with 

O 

b representing the rate of inertia and body forces. In this case, the large 
deformation effects are through the term r-L and the gradient operator, and 

O 

the rate of inertia force b. The equilibrium equation for the corrotational 
stress rate in the updated Lagrangian approach is 

(6) A V/v • (< 7 r — D • r — r ■ ft) + pn b= 0 . 

17.5. CONSTITUTIVE LAWS 

A constitutive law describes the mechanical properties of a material. 
The law should be independent of the frame of reference. Hence, ideally 
every term in a constitutive law must be objective, i.e., unaffected by trans¬ 
lation and rotation of the frame of reference. A stress rate can be brought 
about not only by the strain rate D but also by the spin rate ft of the mate¬ 
rial element. However, if the strain rate is zero or the local material motion 
is characterized by pure-spin, objectivity requires that there is no change 
in the stress noted by observer who is spinning along with the material. 
Therefore, in establishing a consistent rate constitutive law, we must define 
a stress rate that does not changes in the body fixed coordinates when the 
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body undergoes a rigid spin (Truesdel and Noll 1965, Atluri 1980, 1984, 
Reed and Atluri 1983, Rubinstein and Atluri 1983). Such a rate is called 
an objective rate. We then assume that the objective stress rate is a func¬ 
tion of the strain rate only (not a function of the spin rate) and postulate 
the relation between the strain rate and the objective stress rate in the 
spinning coordinate system. 

Objective tensors are discussed in Sec. 16.9. The updated stress rates 
have been mentioned in Sec. 17.3. Stresses and their rate in the updated 
Lagrangian description are in general not objective. However, for an objec¬ 
tive tensor A and its updated Lagrangian rate A, the following rate tensors 


are objective: 



(1) 

d o 

A=A — 12A + A12 

(Jaumann rate), 

(2) 

a=a-l-a-a-l t 

(Oldroyd rate). 


Here (?), (?), and (Y) denote, respectively, the updated Lagrangian, Jau - 
mann, and Oldroyd rates of the relevant quantity in the parenthesis, 12 is 
the spin tensor and L is the velocity gradient tensor. The Jaumann and 
Oldroyd rates of an objective tensor A differ from one another by terms 
that are bilinear in A and the deformation rate tensor L or the spin rate 
tensor 12. There are many objective rates of an objective tensor A, but the 
Jaumann or the Oldroyd rates are best known. 

For the Kirchhoff stress, the Jaumann and Oldroyd rates are, respec- 


tively, 



(3) 

cr = <t — 12 ■ er% + <rjv • 12 



= t + tr(D)r — 12-t + T'12 = t + tr(D)r 

(Jaumann rate), 

(4) 

2 ° T r T 

a — <j - h ■ a N - a N ■ h 



= t + tr(D)r - L- t- T'L t = t + tr(D)r 

(Oldroyd rate). 


Clearly the two rates are linked by the following equation 
(5) or = & +D • r + r • D. 

The relation between the rate of various stress measures and the Jaumann 
rate of the Kirchhojf stress tensor in the updated Lagrangian description 
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can be derived from Eqs. (17.3:7) and (17.3:9)—(17.3:12). One finds 


(6) ▲ T = r +tr(D)r — fl ■ r — D • r =er -D ■ r - r ■ ft, 

(7) ▲ r* = r + tr(D)r — ft • r + t • ft — D • t = ct — Dr, 

(8) A S = T -r L r =1 -Dr -r D = a . 

(8a) A cr r = cr . 

o 0 

Clearly cr r is objective. Prom Eqs. (5) and (8), one can show that S is also 
objective. Note that T and r* are not objective. 

Following Hill (1967), one can establish a rate potential V j in terms of 
the strain rate tensor D such that the constitutive equation for the Jaumann 
rate of the Kirchhoff stress tensor in the update Lagrangian description is 


(9) 


a dVj (D) 

° dB 


Once Vj is known, the corresponding rate potentials and constitutive equa¬ 
tions for stress rates such as T, S and r* can be derived. Let U, W and Q 

O 

denote the rate potentials for the first Piola-Kirchhoff stress rate tensor T, 

O 

the second Piola-Kirchhoff stress rate tensor S, the Biot-Lur’e stress rate 
tensor r* and the corotional stress rate tensor cr r , respectively. Then 


cm -f dU{L) S 9 ^P) o,o 
(10) T_ ~dL F ~' S 5D ’ r==(r )a 

Then, noting that 


6Q( D) o d P(D) 
= 


dB 


dT> 


r = S+ -(r-D + D-r), <r r = S+ Dr + rD 
and using Eqs. (8)—(10), we can show that the rate potential is 
(11) W (D) = V j (D) — t : (D ■ D). 


Equations (8a), (9), and (10) give 


(11a) P (D) = Vj (D) = W (D) + r : (D • D). 


From Eqs. (7), (9) and (10), we have 

(12) Q (D) - Vj (D) — 1 r : (D • D) = W (D) + ~ r : (D • D). 
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Finally, from Eqs. (6), (9) and (10), we obtain 


(13) U (L) = Vj (D) — r : (D • D) + ^ t : (L T • L). 
The relations among the rate potentials can be easily derived 

(14) E (v) = W (D) + T - : - (L 2 T,L) =Q (D)+|t : (L T -L) - 

= P (D) + l -r : (L t • L) - r : (D • D) = V (L). 


Depending on applications, one expresses L and D in terms of displacement 
increments, 17, etc. 

The corresponding constitutive law for the Oldroyd rate of the Kirchhoff 
stress tensor a is: 


(15) 


V d V 0 (D) 

° dT> 


where the Oldroyd rate potential Fq is related to the Jaumann rate potential 
Vj by 


(16) V o (D) = Vj (D) + r : (D • D), 

o o o o 

We can then express the rate potentials U, W and Q in terms Vo by the 
following equations 

(17) A W (D) = Vo (D ), 

(18) A Q (D) = Vo (D) + ^ r : (D • D), 

(18a) A P (D) = Vo (D) + r : (D • D) 

(19) A U (L) = Vo +\ T : (L T ' L) • 

O O 

Note that the Oldroyd rate potential Vo is the rate potential W for the rate 
of the second Piola-Kirchhoff stress in the updated Lagrangian description. 
The equations above are similar to Eqs. (11)—(13). Substituting Eq. (15) 
into Eqs. (6)-(8), one can express the updated Lagrangian stress rates in 
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terms of the Oldroyd rate of the Kirchhoff stress 


T = cr +t ■ L 1 , 
r* = a + r • D, 

(21a) (T r =(T + D • r + r • D 


( 20 ) 

( 21 ) 

(21a) 

(22) S=a . 

Thus the Oldroyd rate of the Kirchhoff stress is actually the rate of the 
second Piola-Kirchhoff stress in the updated Lagrangian description. 

For hyperelastic materials, there exists a strain energy function Wo(C) 
with its second variation defined as ac v ^>c~i rdC; j. One can show 

that the rate potential W for the second Piola-Kirchhoff stress rate is deriv¬ 
able from the second variation of Wo(C). From Eq. (16.11:7) and (16.11:9), 
we have 

<i _dWo_ 0 dWo_ 0 dW Le , 

S ~ 3E 2 dC ~ 2 dC,j 1 J ' 

where S is the second Piola-Kirchhoff stress referred to the undeformed 
coordinates. Then its material rate in the total Lagrangian description is 

£?_ 2 T 2 ?w,.dc pwo . T . 

Dt ~ Dt V. 3C j dC 2 Dt «C 2 ' ( '■ 


or, in indicial notation 

DS d 2 W 0 dxk dxi 

Dt - dC KL dCu dX K dX L kl£l£j 

where Xi and Xj are the deformed and the undeformed coordinates referring 
to the Dn and D 0 configurations, respectively. From Eq. (17.3:16), one has 


° DS 1 ° 

(23) A S= F ■ — • F t j = d ijkl D k ieiej = d : D, or S i:j = d ijk iD k i , 

where d = dij k ieieje k ei is a fourth rank tensor with e, being the unit base 
vectors in the current configuration Dn and 

4 d 2 W 0 dxi dxj dx k dxi 

(24) d ijkl = - gCKi dC;j Qxf q Xk QXl . 

which have the same symmetric properties as those of the elastic constants 

O 

for anisotropic materials discussed in Sec. 6.1. Note that S and D are 
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objective. Prom Eq. (23), the rate potential can be written as 
(25) W (D) = dijkiDijDki /2 , 

which is a quadratic function of Aj related to the second variation of Wo 
(Pian and Tong 1971). With £/, ej being unit base vectors, 
the physical components of a fourth rank tensor referred to the undeformed 
coordinates whereas dijki ' s are the components of the corresponding ten¬ 
sor referred to the deformed coordinates. If £/, e, are contravariant base 
vectors, then d L] ki are the corresponding contravariant components of the 
tensor j bcf K jjc u * n t ^ le deformed coordinates. In this case, we generally 
denote ej, e<, d^ki by G ; , g\ d^ kl , respectively. 

If the hyper-elastic material is isotropic with W 0 (C) = W 0 (I \, A h), 
we have 


d 2 w 0 

dCiddCu 


dW 0 dW 0 

= QJ - (II — H) + A + G, 


where H, A, G are fourth rank tensors defined as 


H 

A 

G 


= diK&JLtlCjeKeL, 

— = ejKNeilMCMNtltjZKtL . 

r II + l|r (hi - C)(AI - C) + ^ Ilc-'c - 1 
m 91 ' n-io-ci) + |^/ 1 (ic- , + c- , D 
mc-H/iI-c) + {hi- Cjc- 1 ], 


where ejKN is the permutation tensor. The derivation is left to the readers. 
Note that I(= &ijej£j) is the identity tensor. Both II(= 
and IC[= (CI) T ] are tensor of rank 4. 

A substitution of Eq. (25) into Eq. (11), (12) and (14) yields the rate 
potentials for other stress measures. The substitution of Eq. (25) into 
Eq. (16), (18) and (19) yields the rate potentials for other stress measures 
related to the Oldroyd rate [Eq. (15)]. Prom Eqs. (6)-(8) or (20)-(22), the 
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constitutive laws can be written as 

S= d : D , T = d:D + r-L T , 

(26) (x=d:D + L- T-t-T-L T , r=d:D + L- r + r-L ,r - tr(L)r 
r*=d:D + T'D, <r r =d:D + Dr + TD. 

One sees that the constitutive laws consist of two parts: one part relates 
to the energy density represented by the first term on the right hand side 
of the equations above. The second part includes the remaining terms of 
the equations involving the interaction between the current Cauchy stress 
tensor r and the incremental change of geometry. 


Problem 17.2. Show that 


acr 1 

<9C 


-C -1 C 


i^-i ejKNeiLMCMf 


(Hint: Define C' = hC 1 and show that Cjj — ejKNeiLMCxLCNM /2.) 


17.6. INCREMENTAL VARIATIONAL PRINCIPLES IN 
TERMS OF T 

We shall construct variational functionals involving stress and deforma¬ 
tion rates in such a way that the conditions for the vanishing of the variation 
of the functional leads to the Eulerian equations for the rate variables, which 
are exactly the same as the field equations for these rate variables in the 
continuum. Thus, the solution of the field equations of the boundary-value 
problem corresponds to the stationary condition of the functional and vice 
versa. Pian and Tong (1971) introduced the second variation of the original 
functional as an incremental functional for finite deformation analyses. One 
can often derive different variation principles from a given functional (Pian 
and Tong 1969) by selectively relaxing constraints on the field variables 
of the functional (see Sec. 10.5). One can also derive different variational 
principles by enforcing selected constraints, a priori , and thus, reducing 
the number of independent field variables. The derivation in the next two 
sections follows mostly those given by Atluri (1979, 1980a) and Atluri and 
Cazzani (1995). 

A Four-Field Principle. Atluri (1979, 1980a) constructed the following 
variational principle which involving four tensors: the displacement rate 
tensor v, the spin rate tensor Cl, the stretch rate tensor D, and the material 
rate of the first Piola-Kirchhoff stress tensor T in the updated Lagrangian 
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description 


(1) n 1 (v,fl,D,T )=/ (Q(D) + iT:(ft T -fi) + r:(ft T -D) 
Jv N l 2 

+ T t : (' V N v — 12 — D) - p N b vW 


N 


n • T ■ (v - v)gL4 


fsirN 


0 1/ 

T • vcL4, 


QU 

where T, v are the prescribed traction and displacement rates on the 
boundaries, V jy is the volume of the body in the reference configuration 
Dn, n is the unit outward normal to the surface Spj, and S a ^, S u n are 
the portions of where the traction and displacement increments are pre¬ 
scribed, respectively. The requirements for the field variables are: v is con¬ 
tinuous, 12 skew-symmetric and D symmetric. The last two requirements 
are called the rigid constraint, which must be enforced in constructing a 
weak form solution. In this case, 12 has only three independent components 
and D has six rather than nine normally for a tensor of rank 2. 

We shall derive the stationary condition of IT for arbitrary variation 
of the four field variables subjected to the rigid constraints to obtain the 
appropriate field equations and natural boundary conditions. To derive the 

O 

variation of IIi, one must take into account that 
T T : <SD = (T T ) a : JD , 

T T : <512 - - T T : SQ T = - T: JO , 


for symmetric <5D and skew-symmetric 512. One also needs to use the 
following trace and divergent properties: 

r : (12 t • 5D) = r : (5D ■ 12) = -t : (5D • 12 T ) = -(r • 12) : 5D, 

t : (512 t ■ D) = (t • D): 512 T = (D • r) : 512, 

r : 5(12 r • 12) = t : (512 r • 12 + 12 T • 512) 

= (r ■ 12 r ) : 512 r + (12 • r) : 512 = 2(12 • r) : 512 , 

f T: V pjfivdV = f n • T • 5vcL4 — f (V w -T ) ■ SvdV. 

Jv N Jsn Jv N 
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The first variation of TIi is 

dQ 




dD 


— (T + r ■ Cl), 


:<5D + aT r : (' V N v - Cl - D) 


+ (T + D • r + Cl ■ r) : - (Vai • T + p N b) • <5v j dV 

r ov o t 0 

- (T - n • T) ■ SvdA - n • 6 T • (v — v)dA , 

J S t r n * 5 W ]y 

with the rigid constraints (<fD) a = 0 and (SCl) s — 0. The stationary condi¬ 
tion of IIx gives the following incremental field equations, 


QQ ° 

(2) = (T +r • ft),, (constitutive law in Vjv), 

(3) Va?v = D + ft (compatibility condition in Vjv), 

O 

(4) (T+D-r + fl - r) 0 = 0 (angular momentum balance in Vjv), 

O O 

(5) Vyv • T +pn b = 0 (linear momentum balance in Vjv), 

and the following natural boundary conditions, 

O °v 

(6) n • T = T (traction boundary condition on S, t jv), 

(7) v = v (displacement boundary condition on S u ^). 

One can establish different variational principles by imposing a priori 
conditions to reduce the number of field variables in Eq. (1). One can also 
derive additional variational principles by using the Lagrangian multiplier 
to introduce new field variables. We will discuss several of such principles 
in the following subsections. 


A Three-Field Principle. A variational principle with a functional in- 

O 

volving three tensors v, ft and T can be obtained by imposing 


(8) 


D = (V*v). 
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a priori in Eq. (1) to give 


(9) 


n 2 (v,n,T) = / (q[(v„v)*] 

Jv N i 

+ ^r:(n T -n) + r:[n T .(V w v) s ] 

+ T T : [(Vjvv). - O] - PN b • v} dV 


L 


(n ■ T) • (v - 



Oj y 

T vdA. 


The admissible requirements for the field variables are: v is continuous and 
fj is skew-symmetric. The stationary condition of IT 2 gives the same set of 
equations as Eqs. (2)-(7) except that D is replaced by (V jvv) s in Eqs. (1) 
and (4) and that Eq. (3) becomes 


( 10 ) 


(Vyvv)„ = O. 


A Purely Kinematic Principle. A variational principle with a functional 
involving v only can be derived by requiring, a priori, 

(11) Vyvv = D + n = L, D = (Vaiv),, n = (Vjyv) 0 
and v = v on S u n. Equation (1) reduces to 

(12) II3 (v) — [ < E (v) — pN b • v > dV — f T • vdA, 

Vn l J 

O 

where E is given in Eq. (17.5:14). The specific expression depends on the 
choice of rate potential. In Eq. (12), v should be continuous and equals v 
on 5„Af. 

O O 

If E is in terms of W, Eq. (12) is the second variation of the functional 
of Eq. (16.13:16) (Pian and Tong 1971) derived by using Eqs. (17.2:20)- 
(17.2:22). One can show that 
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where W(D) is given in Eq. (17.5:25). Equation (12) then becomes 
( 12 a) ft; 3 (v)=^ {w[(V*v) s ] + ir:[(V N v) r .(V;vv)] 

— Pn b • v jdP — J T• vdA. 

One can introduce f2 as an additional independent field variable by 
modifying II 3 as 

(13) n$(v) = jf | W [(V/vv),] + ^r 

+ 2 [(^ ArV ) a “ ^] 2 _ P N 

where c is a given positive constant, and 

[(V w v) 0 - Q} 2 = [(V„v) a 4 - n] : ((Vyvv)a - «] . 

The admissibility requirements for the field variables are: v is continuous 
and equals v on S u n, and O is skew-symmetric. Equation (13) is equivalent 
to the penalty function approach. One usually chooses c to be of the order 
of the elastic modulus of the material. 

A Complementary Three-Field Principle. We shall derive a comple¬ 
mentary principle with a functional involving three tensors v, and T, 
which is a variational principle based on the complementary strain energy 
density (a function of stresses). The concept of complementary work and 
complementary strain energy are explained in Sec. 10.9. We shall use the 
terminology and call the variational principle complementary when it is 
based on a rate potential, which is a function of stress-rate tensor. The gra¬ 
dient of the rate potential with respect to the stress-rate is a deformation- 
rate tensor. We introduce the contact transformation 

(14) - Q c [(T + r : A),] = Q (D) + r : (S1 T ■ D) - (T T ) s : D, 

O 

to eliminate the explicit dependence on D in Eq. (1). Now Q c is a function 
of a stress rate (T 4- r : fl) s and is called the complementary rate potential. 

O O 

We first show that the gradient of Q c with respect to (T 4-r : E5) s is the 


: [(V;w) r • (V w v)] 
b v\dV- [ T • vdA , 

J Js„ N 
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stretch rate tensor D. Consider the variation of Q 

9 Q (D) 


6Q C = 


dD 


(T ■ ft ) a 


: <5D — t : (SSI* • D) 


+ (T T ) S : £D + (6 T t ) s : D. 

Taking into account that 

d ^- - -(T t + t -» t ) s = q , 

T:(<5fi T -D) = (D-T):50, 

one finds 

5 Q c = D : (S T% + D : (St • fl) 4 = D : $(T + r • O),. 

o o 

In other words, Q c is a function of (T + r • fi) s only and 

0 

(15) . 0 - 9 $ c -= D . 

d(T + t ■ fl). 

A substitution of Eq. (14) into Eq. (1) yields 

(16) n 4 (v,f2,T ') = jf |-4 [(T + r -0).] + ^r: (fi T • fl) 

- PN b • v + T t : (V N v - fl) j dV 


— f n • T ■ (v — v)dA — f T • vdA . 

JS U N J S<t AT 


The admissibility requirements for the field variables are: v is continuous, 

O 

T is piecewise continuous and n is skew-symmetric. It can be shown that 

O 

the field equations associated with II 4 are 


(17) 


(18) 


V;w = 


dQ c 


d(T + r- n). 


+ fi (compatibility condition), 


T+- 


QQc 


d(T + r • O), 


■ r + O • r 


0 (angular moment balance), 
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and the equilibrium equation Eq. (5) in V^. The natural boundary condi¬ 
tions are Eqs. (6) and (7). 

One can independently enforce the condition Eq. (18) also by adding a 

term c[T +(V/vv) T • t n ]\/2 to the volume integral in Eq. (16). The new 
functional is 


(19) n i *(v,n,T) = Q c [(T 


[(T + r-fl) s } + ~T:(n T ■ U) 


+ T: (V N v - n) + t [T + (V N v) ■ r\ 2 a - p N b • v \dV 


j n • T • (v — v)dA - I 

Js u n J s a 


T• vdA. 


where c is a positive constant of the penalty function approach. In Eq. (19), 
c is of order \}\t n \ or larger. 

17.7. INCREMENTAL VARIATIONAL PRINCIPLES IN 
TERMS OF r* 

One can derive incremental variational principles in terms r* simply by 
substituting 

T = r* - r • ft 

into Eq. (17.6:1) and using the following properties 
~(fl T ■ r) : (V/vv - n - D) 

= (ft • t) : (Vjvv - O — D) 

= r : (fi T • V N v) - r : (O r • U) - r : (fi T • D). 

One finds 


Q (D) - ir : (fl T ■ fl) + t : (fl r • V N v) 


(i) n 5 (v,n,D,r*) = | 

+ t* t : (V N v - n - D) - p N b • v j dV 

fa f °- 

- / n • (r * T - t ■ ft) ■ (v - v)oL4 - / T- 
J SuN J S<rN 


vdA. 
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The admissibility requirements for the field variables are: v is continuous, 
fi skew-symmetric, D symmetric, and r* piecewise continuous. 

By requiring D — ( V nv) s a priori in Eq. (1), one obtains a three-field 
principle in terms v, fi and r*. The functional is 


( 2 ) 


n 6 (v,n,?*) 


4 - r : (fi T • Vwv) 4 - r* T : [(Vjvv) a - fi] — pn b ■ vl dV 


-Is 


n) 


s)dA - J 


T • vdA. 


The admissibility requirements for the field variables are the same as those 

O 

for the functional n 5 , except that D does not appear here. If we further 

O 

require that fi = (Vjvv)» and v = v on S u n are satisfied a priori, n 6 

O 

reduces to II 3 as given in Eq. (17.6:12). 

As in the previous subsection, the use of the contact transformation 


(3) 


Q (D) - (r*), : D = - Q c [(?*),] 


in Eq. (1) leads to an alternate three-field principle with the functional 

(4) n 7 (v,n,r') 


-/ { 
Jv N l 


Qc [(?*).] - 2 r : ■ «) 


+ r : (fi T • Vjvv) + r* T : [(V w v) a - fi] - pN b • v j dV 
-f n • (r* T — r • fi) • (v — \)dA — f T • vdA. 

JS u fij ** S<tN 

The admissibility requirements for the field variables are also the same 
of those for the functional IIs without the dependence on D. 


Problem 17.3. Show that a four-field principle involving v, fi, D and S 
can be written as 
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fli (v,n,D,S) = J | W(D)-^r:[(D + fi) T -(D + fl)] 
+ r : [(V nv) t ■ (D + O)] 


4- S: (Vwv — fi — D) — pN b • 

— f n ■ (S + T ■ L t ) • (v — v) dA 


dV 


L 


T• vdA. 


17.8. INCOMPRESSIBLE AND NEARLY INCOMPRESSIBLE 
MATERIALS 

For materials approaching incompressibility, the formulation in 
Secs. 17.6 and 17.7 breaks down because the equivalent bulk modulus tends 
to infinity. To circumvent the difficulty, we shall modify the variational 
principles discussed before. We first introduce the deviatoric variables 


(1) 

L'=L —I J/3, 

(2) 

D' = D - I J /3, 


in which J is, from Eq. (17.2:10), 

(3) J— tr(L) = tr(D). 

Denoting the rate potentials given in Sec. 17.5 as follows: 

(4) Q*(D',J) = Q(D), W*(D', J) = W (D), U*(L', J) = U (L), 

one has the constitutive laws 

S,J^(D\J) _0Q*(D',J) ^, T _dU*(V,J) 

( 5 ) S =- — -, (r ) a = g b , , T =- dL -, -, 
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(6) 


» = W) = 1W) , 1., T _1 ,o, 

dJ ! ® ' 3 3 tr(r 


dJ 6 


o dU*(L', J) 1 o 

-Pt= -= o tr(T), 

dJ 6 


which relate deformation rates to different measures of stress rates. One 
can show that 


(r*). :ST> = 6Q= 


SD' 


: <5D' + 


dJ 


= (r'*). : (®-I#}/3)-p, SJ 


= (r% : - [(?'*), : I](I: <5D)/3 -p r I:SD, 


which implies 

(7) :I]I/3-p r I. 

Similarly, one can establish that 


(8) S = S'-(S': 1)1/3-p a I, 

(9) T = T' — (T' : 1)1/3 — p t I. 


The relations between various updated Lagrangian pressure rates can be 
obtained by differentiating Eq. (16.7:11) with respect to t with all quantities 
referring to the current configuration in D^: 

(10) -p = r : 1/3 = - p t + tr(D)p + r : D/3 

= - Ps + tr(D)p + 2r : D/3 = - p r + tr(D)p + r : D/3, 


where p is the hydrostatic pressure rate and p[— — (r : I)/3] is the hydro¬ 
static pressure. In general, p t =p r p, ^ p. 
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If one replaces 

Q->Q *, W~*W*, U->U*, D —> D' + J I, L —> 1/ + JI, 

by substituting Eqs. (l)-(4) and (7)-(9) into the functional defined in 
Secs. 17.6 and 17.7, one can obtain various new variational principles in 
terms of the new field variables. We shall illustrate the process with the 
following example. A substitution of Eqs. (2)-(4) and (7) into Eq. (17.7:1) 
yields 

(11) n (v,i2,D', J,r'*,p r ) 

= [ (ft T ■ ft) + t : (ft T ■ V N v) 

Jv N l 2 

- p r [tr(Vjvv)- )] + v'* T :[V N v-n-iy - tr(V w v)I/3] 


-p/vb vjdE- f T • vdA 

J Js,r N 



The functional holds for all materials for continuous v, skew symmetric ft, 
symmetric deviatoric D', positive J, piecewise continuous deviatoric r'* 
and piecewise continuous p r . This functional is slightly different from that 
derived by Seki and Atluri (1994). In a similar fashion, one can derive the 

° 0 o O ° o 

variational functionals using v, O, L', J, T', p t or v, 12, D', J, S', p s as 
the independent field variables. The details are left to readers. 

One can derive different variational principles from Eq. (11) using the 
contact transformation 

(12) - Q c [(r'*) s ,p r ) = Q*(D', J) - (r'*) s : D'+ p r J . 

One can show that 

ggcj(^%Prl p / 9Q c [(r'*) a ,p r ] ° 

d{r'*) s ’ dp r 

A substitution of Eq. (12) into Eq. (11) yields 
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( 13 ) n B (v,n,r # *,p r ) 

= Jv [-QAr'*,Pr)-\r:(n T - Q) + r : (fi T ■ V N v) 
+ ?'* T : [V/vv - Cl - tr(Vjw)I/3] 

- p r tr(V w v) - p N b ■ v j dV - 

~ j n 


r* - i tr(r'*)I- p r I - r • O 


T • vdA 

(v-v)dA, 


which is a functional with the four-field variables v, fi, r'*, p T . The sta- 

O 

tionary condition of IIs under the variation of the field variables subjected 
to the rigid condition (fl) s = 0 gives the appropriate field equations and 
natural boundary conditions. The derivation is striaghtforward and is left 
to the reader. 

One can also introduce a mixed contact transformation 

(14) Q m (D',p r ) = <3*(D', J) + p r J 

such that 


dQm (D'.Pr) = 

dD' ^ h 


dQ m (D',p r ) _° T 

9p r 


A substitution of Eq. (14) into Eq. (11) yields 


(15) II 9 (v,fi,D',r'*,p r ) 

= jf |(5 rn (D',p r )-r:(n r .V w v)-ir:(n T .n) 

+ r'* T : [V N v - O - D' - tr(V w v)I/3] 


- p r tr(Vjvv) - p N b v 


) dV ~L 


T • vdA 


- f n • [ r'* — ^ tr(r'*) I — p r I — r • fll (v — v) dA , 
Js uN L 3 J 

which is a functional with the five-field variables v, ft, D', r'*, p r . 
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In Eqs. (14) and (15), J does not appear explicitly and both Q c and Q m 
are well behaved as the material approaches incompressibility. Thus both 
functionals fig and II 9 are suitable for incompressible or nearly incompress¬ 
ible materials. 

One can derive additional functionals by improsing constrain condition 
a priori. For example, if the compatibility condition are satisfied a priori 


(16) D' = (Vatv)s - I: (Vjvv) a /3 in V N , 

(17) fl — ('Vpjv) a in Vn , 

(18) v = v on S u n ■ 


Then Eq. (15) is reduced to 

(V w v) s - ^ tr(V N v),p r 

- \r : [(V N v)Z • (V N w) a ] + r : [(V N vft ■ V w v] 

- p T tr( V atv) - pjv b • v j dV - f T • vdA , 

J J S<rN 

which is a functional of the two fields, piecewise continuous p r and contin¬ 
uous v. Tong (1969) first derived II 10 for infinitesimal deformation. 

When fi is re-introduced as part of penalty function, we can extend the 
two-field functional II 10 to the following three-field functional: 


(19) n 10 


(V.Pr) = / 

JVr 


(20) n* w (v,n,p r ) 


/ {■ 


(V w v) a - -tr(VVv),p r 


- : {(V N y)l ■ (V N v) a ) + r : [(V^ • V N v] 


- p r tr(Vjvv) + ~ [(Vjvv ) 0 - ft ] 2 - p N b • v j 


dV 


-L 


T • vdA , 


where c is the constant factor of the penalty function. The functional II j 0 , 
with the condition Eq. (16) satisfied a priori, is different from IT 9 in which 
r'* is an independent field variable. 
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Having fl as an independent field rather than computing it from 
(V/vv) n is important from the computational point of view. As demon¬ 
strated by Cazzani and Atluri (1992), and Seki and Atluri (1994), the so¬ 
lution accuracy for large deformation problems depends strongly on the 
accuracy of ft. The spin rate is needed to compute the updated solution. 
The formulation of Eq. (20) is generally more accurate as it gives ft directly 
without having to differentiate the numerical solution of v. 

17.9. UPDATED SOLUTION 

Depending on the solution approach, during each increment, one solves 

° o ° 

directly for the stress increment rates T, r * or S, the deformation rates 

D, ft and the displacement rate v. One then evaluates the Cauchy stress 

increment rate r from Eq. (17.3:7), or Eqs. (17.3:9)-(17.3:12). In solving 

for the stress rates, the information needed includes: the Cauchy stress 

01 / 

r(= t n ), the increment rate of boundary traction T on S&n, the incre¬ 
ment rate of the body force b, the increment rate of boundary displacement 
v on 5 u jv, the deformed geometry of the body in Dn, and the constitu¬ 
tive laws that relate the stress rates to deformation rates. The constitu¬ 
tive law generally depends on the total deformation gradient Fff and the 
Cauchy stress r. If a constitutive relation is known in the form of the gra¬ 
dient of a rate-potential in terms of a specific strain rate, e.g., Eq. (17.5:9) 
or (17.5:15), one can derive other rate-potentials through Eq. (17.5:11)— 
(17.5:13) or Eqs. (17.5:16)—(17.5:19) and determine the corresponding con¬ 
stitutive laws accordingly. Hill (1967) and Atluri (1979, 1980a) show that a 
rate potential V j (D) can be expressed in terms of the strain rate tensor D. 
Pian and Tong (1971) show that the rate-potential W (D) can be derived 
from the strain energy density H'o(C). Equations (17.5:24) and (17.5:25) 

O 

show the dependence of W (D) on the total deformation gradient and the 
related deformation matrices. 

The updated Cauchy stress is simply t n+1 = r + r At, which is used 
to determine the solution of the next increment. The current Cauchy stress 
is simply the sum of all previous increments, i.e., 

(1) r N+1 = ]T r 4 AU , 

t—i 

where r* At, is the Cauchy stress increment of the i th step. The direct 
summation of the increments of other stress measures such as T, r* and 
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S does not have any physical meaning, because each increment refers to a 
different configuration. 

The updated total displacement vector and the updated position vector 
of material point after each increment are 


(2) A u iV+1 = u N + vAf, 

(3) A X N+l =X N + vA t = X° + u N+l . 


Based on the updated u N+1 and X N+1 , one can evaluate the updated 
total deformation gradient and the related deformation matrices. Together 
with the updated t, one can update the rate-potential and the constitutive 
equation, e.g., Eqs. (17.5:25) and (17.5:26), for the solution of the next 
increment. 

In Secs. 16.9 and 17.5, we discuss the need of maintaining objectivity 
in establishing the constitutive laws. There is also a need of maintaining 
objectivity in computation. The rate of change of stress depends not only 
on the strain rate D but also on the spin rate 12 of the material element. 
An objective constitutive law gives the relation between the strain rate and 
the objective stress rate in the coordinate frame rigidly spinning with the 
material element. The relation between the strain rate, the spin rate, and 
the rates of other stress measures can be obtained by transformation of the 
objective stress rate tensor and the strain rate from the spinning coordinate 
system. 

How well objectivity is maintained in the computation depends on the 
accuracy of the computed spin rate and spinning coordinates. The covariant 
base vector g m of a rigid frame spinning at a rate of u(t) satisfies the 
equation 


(4) 


dgM 

dt 


w • g M , 


at any loading stage. The solution is 


g M(t) = R(t) • gM(fjv) 


where g m(^n) is the initial value at t = fjv, R(t) is the rotation tensor of 
the deformation if u>(t) is as defined in Eq. (16.5:11). Thus to maintain 
objectivity, we should assure that the computed R(t) is a rotation tensor 
(Hughes and Winget 1981, Rubinstein and Atluri 1983). 

It can be shown that the linearized solution of Eq. (16.5:11) is 


R(fjv + At) = [I + 0(tw)Af] • R(tjy) 
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where fl(tjif) is the spin rate of the deformation in the updated Lagrange 
description. Then, obviously 

(5) R(t N + At) • R(t N + At ) T 

= R (t N f ■ [I + 12(tjv) r Af] • [I + ll(t w )At] ■ R (t N ) 

= I + R (tN) T ' fl(tjv) • O (tfj) T • R(f#)At 2 , 

since R (tn) T • R(f a) = I and 12(tw) r + O(tw) = 0. The error in rotation 
is of the order At 2 . Objectivity is maintained only in the limit of At —* 0. 

In practical computation, the increment At is finite. In order to improve 
the preservation of objectivity in the incremental solution, we modify the 
computational algorithm to minimize the error caused by finite incremental 
steps. The accuracy of integrating Eq. (4) is improved, if an appropriate 
value of 0*[= 0(t*), where tp/ <t* < f/y + At] is used instead of O(tjy) in 
the Euler approximation given above. Hughes and Winget (1981), Rubin¬ 
stein and Atluri (1983), Key (1980), Courant and Hilbert (1953) and others 
have shown that the midpoint rule of integration (i.e., use t* = tyy + At/ 2 ) 
provides an accuracy of the order (At) 2 , 

The midpoint rule can be easily implemented by evaluating 12(f) at the 
midpoint of each increment, i.e., S2(t/y + At/2). In other words, in deter¬ 
mining the solution in Dn +i configuration, instead of using the operator 
V/v + 1, we use 

( 6 ) ^n+i/ 2 = £ i dX N +1/2 > 

where % N + 1 / 2 = X w + vAt/ 2 . Then, we define the rotation and the 
deformation gradient as 


(7) 

( 8 ) 

where 

(9) 


R(fjv ~h At) = [1 + ft (t/y + At/2) At] • R (tjv) 

( tN + t ) ’ 


f=f^ +i = * x :: /2 =f 


dX° 


n (^N + = ^[Vat + x/ 2 v - (V w+ 1 / 2 v) t ] . 


We may also compute the orthogonal rotation by 

( 10 ) R(% + At) = exp(12At)R(fw) 
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which can further be approximated as 

(11) R (t N + At) = [1 + flAt + fl T ■ fl(At) 2 /2]R(t N ). 

We calculate C^ +1 , B^ +1 and U^ +1 from F^ +1 and R^ +1 . Depending 

on the approach one has the solution in terms of T, r*, S or a r . Then one 
of the following relations is used to calculate t, 

(12) T = o- r +Cl-T-T-tt — tr(L)r, 

(13) T = S + L-T + T-L T -tr(L)r, 

(14) t = r* + L • r — tr(L)r — r • fl, 

(15) t = T + L • t — tr(L)r , 


where L and fl are the half-step values defined in Eqs. ( 6 ) and (9). In¬ 
cremental numerical solutions are numerous, e.g., Pian and Tong (1971) 
Murakawa and Atluri (1978, 1979) and Seki and Atluri (1994). 

17.10. INCREMENTAL LOADS 

The value of load increment at each increment step depends on the na¬ 
ture of applied loads. Normally the applied loads are of either a deadweight 
or pressure type. The direction of a deadweight load does not change (e.g., 
the gravitational force per unit mass) and its magnitude is given in terms 
of unit undeformed volume or surface. On the other hand, a pressure loads 
is in terms of unit deformed area and both its direction and magnitude can 
change at each increment. 

oy 

We are going to determine the applied traction rate T in the updated 
Lagrangian description in the N configuration. The rate depends on the 
nature of the applied loads. Prom Eqs. (17.6:6) and (17.3:7), we obtain 

OV o 

(1) T = N- T = N- t + N- [tr(L)I - L] • t , 

where N is a unit normal to dA N that dA N = N dS N , L is the rate of 
deformation gradient and T, r are, respectively, the Lagrangian and the 
Cauchy stress rates in the updated Lagrangian description. The first term 
of the right most equation above reflects the change in the loads. The second 
term [tr(L)I — L] • r results from the deformation of the boundary surface. 
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For applied pressure loads (without shear), 

p N+1 dA N+1 = (p N + °p Ai)[l + tr(D)Ai](I - L T A t) ■ dA N 


= p N dA N + {pl + [tr(D)I - L T ]p N }At ■ dA N , 

O 

where p is the prescribed incremental rate of pressure. Note that, from 
Eq. (16.2:18), we have 

dA N + l = J% +1 [(F- l ) T ]% +1 dA N . 

OV 

Then, from T dA N = p N+l dA N+l -p N dA N , the applied load increment is 

(2) k T=pN + [tr(D)N-L r -N]p /v , 

for the rate of traction increment on the deformed traction-prescribed sur¬ 
faces in the updated Lagrangian description. 

For a deadweight type of applied loads, the forces are normally referred 

to the undeformed configuration. Let dT N+l be the applied force vector 
acting on an infinitesimal surface dA N+l on the boundary in the Zbv+i 
configuration. We have 

V N +1 

(3) d T = T^ +1 ■ dA° = (Tq + AT) • dA° 


V Z N it 

= t n -dA N + AT dS° =T dS N +AT dS°, 

where the superscript “0” refers to the undeformed state, Tq +1 , Tq are the 
first Piola-Kirchhoff stress at the N + l and N deformation states referred 

V 

to the undeformed state. Thus A T is the given traction increment. The 
it - 

force vector dT N+1 also relates to the traction T N and the surface area 
dS N = IdA^I in the Djg configuration by the equation 
t/JV+1 i/ N+l v N ol/ 

(4) dT =T dS N = (T +TA t)dS N . 

Comparing Eqs. (3) and (4) gives 

° 4 - A ^ dS ° 

~ ~Ki~ ds N ’ 

Using Eqs. (16.2:14), one finds 


(5) k 


% AT v/N-Bo^-N 
At Jff 


for the rate of traction increment on the deformed traction-prescribed sur¬ 
faces in the updated Lagrangian description, where B^[= Fq(Fq) t ) is 
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the Almansi tensor in terms of the deformation gradient Fq given in 
Eq. (16.2:8), and Jq is the Jacobian referred to the undeformed state. 
Note, lower case n is used as the unit normal to the deformed surface da in 
Eq. (16.2:8) and presently they are denoted by N and dA N , respectively. 

For body forces proportional to the mass per unit undeformed volume, 
the rate of body force increment in the updated Lagrangian description is 

° —^1 - Ab 1 
- A t dV N ~ At J 0 W ’ 

where Ab is the body force increment per unit mass at the N th step of 
increment. 

We summarize the rate potentials and the constitutive laws developed 
in this chapter as follows: 


Rate potential 

Constitutive law 


W (D) - V 0 (D) = diikiDijDn /2 

£ aiv(D) 

° — 8D 


Q (D) = Vo (D) + \r : (D • D) 

r -(?*). 4^ 


U(L) = V 0 (^) + ir:(L 7 ’.L) 

t 

II 

% 

r 


P (D) = V (D) + r : (D • D) 

— 9D 


Q c [(T + r:fi) s ] = (T r ) s :D 

D = 0 d ° Q *— 

S(T+r-n) a 


- Q (D) + r : (O • D) 



W (D', J) —W (D) 

o/ . 9W-{D',J) ° _ 

3 “ <?D' > Ps- 

SW*(D',3) 

9J 

Q*(D',J)=<3(D) 


ecrcD’.J) 

ej 

U *(L', J) = U (L) 

^,T_9U{L\J) ° 

X — dh' ’ Pt 

dir(V,J) 

d°J 


Note: V 0 (D) = Vj (D) + r : (D • D) 


17.11. INFINITESIMAL STRAIN THEORY 

The incremental variational principles previously formulated can be re¬ 
duced to the case of the infinitesimal strain theory by simply regarding 
the current configuration Dm as the initial configuration and assuming no 
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initial stress, i.e., t n = 0. In this case, all stress measures become the same 
t n+ 1 = t N+1 = r * N+1 = S N+1 = r At = T At = r*A t = S A t, 

T' = r'* = S' = r' = r — tr(r)/3, 

O O O 0 

P = Pr=Pt=Ps, 

The following rate potentials are also equal, 

Fy (D) = Q (D) = W (D) = U (L) = P (D). 


For linear elastic isotropic material, one has 
Q (D) = + GDijDij , 


<T(D',}) = IA J 2 + G (d^ + i i JS^ 

=K A+ f) }2+Go « x ’«' 

(D',p) = crpy,j)- p°j = GD^D'^ - ~ p , 


(T'.p) = g*(D', J) - r^ - pj = r' r' 


2 A: 


where A is the Lame constant, G the shear modulus and K the bulk modulus 

For materials approaching incompressibility, A and K tend to infinity while 

o o o 

J approaches zero. Therefore Q and Q* are not suitable for numerical 

O O 0 

computation. However, both Q m and Q c are well behaved since p is finite. 


PROBLEMS 

17.4. Derive a variational principle using v, SI, L', J, T p t as the field 

variables where T / , p t are defined in Eqs. (17.8:5) and (17.8:6). Also, derive the 
field equations and the natural boundary conditions associated with the station¬ 
ary condition of the functional. 

17.5. Derive the field equations and the natural boundary conditions for the 
functionals Ilg,n 9 ,Ilio,nio defined in Sec. 17.8. 
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Numerical approach is a “must” to establish satisfactory approximate so¬ 
lutions for most practical problems. There are many numerical methods. 
A treatise on the entire subject is beyond the scope of this book. In this 
chapter, we focus on the fundamentals of the finite element method. We 
will explore the finite element concept, the approximations involved, the 
construction of the finite element equations and the use of incremental 
approach to nonlinear problems. 

The finite element method is one of the most important developments 
in numerical analysis. It is versatile and powerful for solving linear as 
well as nonlinear complex problems. The method was developed in the 
1950’s by engineers as an outgrowth of the so-called matrix method (Argyris 
1955, 1966, Argyris and Kelsey 1960) for systematically analyzing complex 
structures containing a large number of components. Over the years, the 
finite element method has spread to applications in all fields of engineering, 
science and medicine. 

The finite element method establishes approximate solutions in terms of 
unknown parameters in sub-regions called “elements” and then deduces an 
approximate solution for the whole domain by enforcing certain relations 
among the solutions of all elements. For structural analysis the procedure 
is to express the relations between the displacements and internal forces at 
the selected nodal points of individual structural components in the form of 
a system of algebraic equations. The unknowns for the equations are nodal 
displacements, nodal internal forces, or both. Depending on the unknowns 
and dependent variables selected, the method is qualified with word like 
displacement, force, hybrid or mixed. The system of equations is written 
most conveniently in matrix notation, and the solution of these equations 
is obtained efficiently by high-speed computation. 

The foundations for many finite element methods have been established. 
Formulations of methods with variational principles or principles of virtual 
work were developed by Courant (1943), Gallagher et al. (1962), Besseling 
(1963), Jones (1964), Pian (1964), Fraeijs de Veubeke (1964), Herrmann 
(1965), Prager (1967, 1968), Tong and Pian (1967), Pian and Tong (1969), 
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Oden (1969), Tong (1970), Zienkiewicz (1971), Atluri (1980, 1995), Hughes 
(1987), and Pian (1995). Tong (1966) developed the early finite element 
method for fluid mechanics. The method has become a major tool for com¬ 
putational fluid dynamics (Hughes and Ranca 1987, Tezduyar et al. 1988, 
Gunzburger 1989, Shakib et al. 1991, Capon and Jimack 1995), magneto¬ 
hydrodynamics, radiation analysis, and heat transfer. A major branch of 
the finite element method is the boundary element method (Cruse 1969, 
Curse and Rizzo, eds. 1975, Brebbia 1984, Chen and Zhou 1992, Camp and 
Gipson 1992, Hall 1994) in which one transforms the problem to have all 
the unknown field quantities appear on the boundaries only to avoid deal¬ 
ing with the entire domain. One then establishes the algebraic equations 
for the unknowns of selected discrete locations. The accuracy of the ap¬ 
proximate solution depends on the smoothness of the field variable. When 
a physical problem is analyzed using the finite element method, it involves 
approximation in the geometry as well as local solutions. The errors can 
be estimated from the approximate solutions. If the errors are large, then 
the finite element model is refined by improving the geometrical approxi¬ 
mation and/or the representation of the local solutions. The new model is 
re-analyzed until the estimated errors fall below the specified limits. This 
method is called the adaptive finite element method. A recent review of 
the methods for error estimation and the adaptive refinement processes is 
given by Li and Bettess (1997). The topics of fluid mechanics, magneto¬ 
hydrodynamics, etc. (topics other than solid mechanics), the boundary el¬ 
ement methods, and adaptive finite element methods are beyond the scope 
of this book. Interested readers are referred to the published literature. 

There are many publications on finite element method in journals, pro¬ 
ceedings and books. An up-to-date source of references is the Journal Ap¬ 
plied Mechanics Reviews, which publishes periodically review articles in 
different areas of the finite element method. For example, in its October, 
1998 issue, Mackerle gave a bibliographical review of the finite element 
method applied to biomechanics published from 1987 through 1997. 

Numerous finite-element computer programs are commercially available 
for specific or general applications. To list a few, they include PINTO (by 
the present junior author and is not commonly available), DYNA3D (Hal- 
lquist 1992, Hallquist et al. 1993), ANSYS, ABACUS, ADINA, STRUDL 
and NASTRAN. 

The finite element method has been extensively applied to complicated 
problems. The description of the problem itself can be very involved and 
prone to errors, e.g., the detailed geometry of the brain, a nuclear reac¬ 
tor, the complete structure of an aircraft. Results of analysis are usually 
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voluminous. A new field has evolved specifically to ease the burden of 
preparing the complex input information and processing the voluminous 
output. This includes techniques for automatic mesh generation, contour 
plots, graphics for input and output. Commercially available software in the 
area includes INGRID (Stillman 1993), PATRAN, HYPERMESH, etc. All 
software can be run interactively or in batch mode. Detailed information 
on the commercially available software can be found in their web-sites on 
the internet. 

To lay down the foundation for all aspects of the finite element method 
with such extensive applications is of course not easy; the learning of it will 
demand great attention. But this is the task we set for ourselves for this 
chapter. In the subsequent discussion, we will use bold letters to denote 
matrices, unless specified otherwise. 

18.1. BASIC APPROACH 

Turner et al. (1956) in their celebrated paper first applied the dis¬ 
placement method to plane stress problems. They divided a structure into 
triangular and rectangular subdomains called “ elements” and designated 
the vertices of the elements as nodes. The behavior of each element was 
represented by the nodal displacements and an element-stiffness matrix that 
related the forces at the nodal points of the element to the nodal displace¬ 
ments. They assembled the eleinent-stiffness matrices to form a system of 
algebraic equations and solved them using high-speed computers. There 
are two key aspects in the finite element method. One is the de¬ 
termination of the element stiffness matrices. The other is the as¬ 
sembly of element-stiffness matrices to form a system of algebraic 
equations, called the global system, for the selected unknowns. 
The former involves the local approximation for establishing the 
relations between element nodal forces and nodal displacements. 
The latter enforces the appropriate relations between the nodal 
quantities of adjacent elements at the inter-element boundaries. 

Equally important, if not more, in the finite element method is its im¬ 
plementation. The success of the finite element method lies largely 
in the development of efficient pre- and post-processors, and al¬ 
gorithms for solving large systems of equations. The pre-processor 
enables users to describe efficiently and in a relatively error free manner a 
complex problem in terms of its geometry, configuration, material proper¬ 
ties, loading conditions, etc., through the input. The post-processor handles 
the voluminous output making it easily understandable through interactive 
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graphic, tables, charts, and summaries. The pre- and post-processors also 
enable users to examine at ease any specific location of the domain or as¬ 
pect of the solution prior to, during, or after the analysis. The need for 
efficient solution algorithms for large system goes without saying. The fi¬ 
nite element method often deals with hundreds of thousands, even millions, 
of equations. Even with the advent of computers today, efficient algorithms 
make it possible to obtain the solution, especially for nonlinear problems, 
at a reasonable cost and in a reasonable time. The concept of finite ele¬ 
ment method dates back to 1943 when Courant used a triangular mesh to 
solve a two-dimensional Laplace equation, or earlier when mathematicians 
formulated differential calculus with piece-wise smooth functions. But the 
method did not catch on until the availability of high speed computers to 
solve the large number of algebraic equations and to handle the tedious 
input and the voluminous output. 

In this book, due to scope limitation, we will not address the issues of 
pre- and post-processors. We will focus on the finite element formulation 
and will touch upon lightly the solution algorithms for large systems. We 
will discuss methods for determining element stiffness matrices, which are 
normally symmetric, and the process to assemble the element matrices into 
a global system of algebraic equations for numerical solution. 

In finite element method, the 
continuum is divided into a 
finite number of sub-regions 
V e in 3-dimension or A e in 2- 
dimension domains, called el¬ 
ements (see Fig. 18.1:1). The 
boundary between two adja¬ 
cent elements, called the inter- 
element boundary , is a point 
for one-dimensional, a line or 
a curve for 2-dimensional, and 
a plane or a curve surface for 
3-dimensional continuum. One 
may visualize that two adja¬ 
cent elements are connected at 
a number of discrete points, called nodes, on their common boundaries. The 
nodes are usually the vertices of an element, the midpoint, the one-third 
and two-third points along an element boundary line, the centroid or some 
specific points of the element boundary surfaces. Points within an element 
may also be selected as nodes, which are called interior nodes of the 



i- 

Fig. 18.1:1. Two dimensional elements in a 
continuum. 



628 


FINITE ELEMENT METHODS 


Chap. 18 


element. The finite element method approximates the field variables within 
an element by interpolating their values at the nodes by shape or interpo¬ 
lation functions. The interpolated functions approximate the behaviors of 
the element through an integral formulation , which will be defined later. 
The nodal values are parameters, called degrees-of-freedom or generalized 
coordinates , to be determined. These two names are used interchangeably 
throughout the remaining chapters. One may also include the nodal values 
of the spatial derivatives of the field variables for this purpose, especially 
when dealing with higher order differential equations. Each node and each 
element can have different number of degrees-of-freedom. The element be¬ 
havior is characterized by these parameters (such as the nodal values of 
the field variable) in the form of element matrices such as element stiff¬ 
ness matrix and element applied force matrix. The local solution of a field 
variable must satisfy certain continuity requirements within the elements 
and along the common boundaries of adjacent elements. This is to assure 
that the finite-element solution will converge to the exact solution as the 
size of elements reduces to zero as the number of degrees-of-freedom per 
element increases to infinity. One usually constructs the element ma¬ 
trices from a functional associated with a variational principle or 
from an integrally weighted-average of the governing differential 
equations. Then, one derives the equations relating the nodal pa¬ 
rameters of all elements based on a stationary condition of the 
variational functional, or the weighted-average of the governing 
differential equations. This process is called an integral formulation. 

In the finite element method, the interpolation functions, the as¬ 
sembling process and the solution methods for a large system of 
equations are generic. They can be applied to almost any types of 
problem. In the subsequent sections, we shall discuss the shape functions, 
the element matrices, the assembly of element matrices, and methods for 
solving large systems of algebraic equations. For efficient implementation 
of the finite element method, we employ a local system to sequentially label 
the quantities of each element and a global scheme to number the quantities 
of the entire system. The list identifying the global labels of the generalized 
coordinates of each element is called the assembly list whose utility will be 
discussed in detail in Sec. 18.4. 

In this chapter, we deal extensively with matrix operation. We shall 
use bold letters, both lower and upper cases, to denote matrices, unless 
otherwise specified. Thus, AB denotes a product of two matrices A and 
B. If A and B are of the order m x n and n x r, respectively, the resulting 
matrix AB is of the order m x r. A m x 1 matrix is a column matrix of 
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order m, whereas a 1 x n matrix is a row matrix. They are often called 
vectors in the literature. Since they do not follow tensor transformation 
rules, in this book, we will just call them matrix to distinct them from 
vector, a tensor of rank 1. Bold Greek letters such as a, f3 , etc., and some 
bold lowercase Latin letters are used to denote column or row matrices 
for generalized coordinates. Specifically, we use bold Greek letter a and 
bold e to represent the stress and strain column matrices (not tensors), 
which will be defined later. We shall also use bold capital T to denote the 
traction column matrix and subscripted capital Tj to represent the traction 
components unless otherwise specified. 

In the next section, for clarity, we shall write out explicitly most of the 
components of matrices in one-dimensional problems. In later sections, we 
shall resort to matrix notations and operations to simplify the expressions 
of field variables and the derivation of equations. 

18.2. ONE-DIMENSIONAL PROBLEMS GOVERNED BY A 
SECOND ORDER DIFFERENTIAL EQUATION 

In this section, we shall discuss the construction of interpolation func¬ 
tions and element matrices for one-dimensional problems. In the process, 
we will point out features that are important to the numerical implementa¬ 
tion of the finite element process. The following notation is used to describe 
the order of continuity of interpolation functions. 

C p Interpolation Function (p th order continuity). A function is said 
to have C p continuity if the derivatives of the function up to order p are 
continuous. Thus, a C° function is continuous and a C 1 function not only 
is continuous but also has continuous first derivatives. 

Let u(X) be a function of the spatial coordinate X over 0 < X < 
L. The domain is divided into a finite number of small, non-overlapping 
segments, called elements (Fig. 18.2:1). The inter-element boundary of two 
adjacent elements is a point, designated as a node. Additional points within 
the element may also be selected as nodes. The behavior of an element, 
i.e., the representation of u(X) within the element, is approximated by the 
interpolation of selected parameters associated with u(X) at the nodes of 
the element. 

We label each node of the whole domain with a unique number, called 
the global nodal number. For instance, for the element over the region X 2 < 
X < X 3 with nodes at X = X 2 , X 3 (Fig. 18.2:1a), the subscripts 2 and 3 
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(a) 2-node element 


(b) 3-node element 


Fig. 18.2:1. Subdivision of a one-dimensional domain. 

are the global nodal labels denoting the 2 nd and 3 rd nodes of the domain. 
Let the values of u(A), a field variable, be 112,113 at X2, A3, respectively. 
Thus, subscripts 2 and 3 of the field variable identify the nodal association 
of relevant quantities. The quantities 1x2, W3, called the degrees-of-freedom 
or generalized coordinates , are generally not known a priori. In other words 
112,1x3 are 2 nd and 3 rd degrees-of-freedom of the problem, i.e,, subscripts 2 
and 3 also denote the global labels of the degrees-of-freedom of the system. 
If there is only one degree-of-freedom at each node, the global degree-of- 
freedom label and the global nodal label are the same. In case that there 
are more than one field-variable and/or more than one parameter for the 
field variable at each node, the global degree-of-freedom label at a node will 
be different from the global nodal label of the node. Properly labeling all 
nodes of the whole domain and/or all degrees-of-freedom are essential to the 
success of the finite element analysis, because most of practical problems 
involve large number of degrees-of-freedom. 

Linear Interpolation (C° function). We seek to represent a function u 
in an element between nodes A2 and A'3 in terms of its values 112, 1x3 at the 
nodes (Fig. 18.2:1a). If u is smooth and if A3 — A2 is much smaller than 
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the overall dimension, u can be approximated by the first two terms of the 
Taylor series as 

(1) n(X)=a 1 +a 2 (X-X 2 ). 

Expressing a’s in terms of nodal values of u gives 

/. X-X 2 \ X-X 2 

( 2 ) n(X) = u ,^^— T J +U3 — : —.. 

Approximating u in the form above for all elements yields a piecewise linear 
representation for u over the whole domain. This approximate function is 
obviously C° continuous. 

We shall write the approximate function u in a form more suitable for 
the finite element process. First we introduce a new coordinate £ defined as 

(3) A £ = (2X~X 2 ~X 3 )/e, 

(4) A £ - W 3 - A 2 , 

where e is the size of the element and £ is called the normalized local coordi¬ 
nate, or natural coordinate of the element. The origin of £ is at the midpoint 
of the element and the range of f is between ±1. We write Eq. (2) as 

(5) A u(f) = + qihid ), 

where 

(6) A = (1-0/2, h 2 {0 = (1 + 0/2, 

(7) A u 2 = qi, u 3 = q 2 . 

The h’s are called shape or interpolation functions satisfying the conditions 

( 8 ) h x (-l) = h 2 (l) = l hi(l) = ha('l) = 0 ■ 

The functions hi as defined are called the Lagrange interpolation where h x 
equals to 1 at the i th (a local label) node of the element and is zero at all 
other nodes. The q’s, like u it have the simple physical meaning being the 
values of the field variable at the corresponding nodes and are called the 
local generalized coordinates as contrast to iq, with the subscript being a 
global label, called the global generalized coordinates. Here, the subscripts 
of h’s and q's refer to the local label, as opposed to the global label for u’s. 
In the present case, the 1 st and 2 nd nodes (the local label) of the element 
are, respectively, the 2 nd and 3 rd nodes (the global label) of the system. 
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Through Eq. (7), the expression in Eq. (5) represents a continuous function 
over the whole domain. 

Equation (1) is the first two terms of the Taylor series expansion. The 
error in appoximation for a smooth function is obviously of the order 0 (e 2 ), 
which is small if the element size e is so. Equation (2) is a typical finite 
element representation for u by linear functions within the element, whereas 
Eq. (5) is a generic expression useful in numerical implementation of the 
finite method. This last expression has the following properties: 

1. The range of the natural coordinate £ is always from —1 to 1 for 
all elements. Equation (3) is the transformation between the global 
coordinate (the coordinate for the entire system) and the natural 
coordinate (the coordinate chosen to conveniently describe the ele¬ 
ment). The use of local coordinate is not essential here but is often 
necessary for more complex problems. 

2. When the interpolation function hi is expressed in terms of the nat¬ 
ural coordinate £. The geometric characteristics of an element, rep¬ 
resented by the element size e here, do not appear at all. 

3. We use cp to denote the value of u at the i th node of the element. 
The subscript i (with i = 1,2) is a label referencing to the nodes of 
the element and is thus a local nodal label. 

The representation of a field variable in the generic form of Eq. (5) does 
not depend on the specific location of the element, or the global labels 
of the degrees-of-freedom of the element. The field variable in other ele¬ 
ments can be represented in exactly the same way, e.g., u in the element 
X n < X < X n +i can be expressed in the same generic form. The el¬ 
ement specifics are reflected only through Eqs. (3), (4) and (7). Thus, 
the generic expression allows us to represent a field variable in an element 
without knowing the specifics of the variable itself. In other words, we can 
approximate any continuous function in a small neighborhood in the form 
of Eq. (5) with the interpolation functions of Eq. (6). This property forms 
the basis of general and efficient algorithms for computing element matri¬ 
ces. Equation (5) relates u at different locations through the subscripts 
in Eq. (7), which describes the relation between the degrees-of-freedom q's 
identified by the local nodal label and the nodal values of u identified by 
the global label. The array relates the local label of a degree-of-freedom 
to the global label of the same degree-of-freedom of every element is called 
the assembly list which is needed for assembling the element matrices into a 
system of equations. We shall further examine this point in the discussion 
of assembling. 
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Using the natural coordinate £ rather than the global spatial coordinate 
X can provide advantages other than just representing a function by the 
generic shape functions. The details will be considered when we discussed 
two-dimensional problems. 

Quadratic Interpolation (C° function). One can improve the approx¬ 
imation by shrinking the element, i.e., refining the subdivision, or by in¬ 
creasing the order of the Taylor expansion. The latter approach means 
constructing higher order element in the finite element language. For ex¬ 
ample, a representation of u as 

(9) A u = ai + ce2(X — X 2 ) + a3(A — X 2) 2 

can be expressed in terms of the nodal values of u at the two-end nodes 
and the mid-point node of the element (Fig. 18.2:1b), i.e., 

( 10 ) ▲ u(£) = qyh \(£) + 92/12(0 + 93 MO , 

where 

( 11 ) = MO = ^^. MO = (i-0(i + 0- 

The interpolation functions h's are zero at £ = ±1 and 0 except for 
h\( —1) = ^ 12 ( 1 ) = M0) = 1. The functions are the Lagrange interpolation. 
The midpoint node is within the element and is called the interior node. 
Nodes 1 and 2 are at the two ends and are boundary nodes. The truncation 
error of Eq. (9) is of the order 0(e 3 ) which represents an improvement in 
the approximation over that of Eq. (1). Again, the generic representation 
in Eq. (10) does not depend on element specifics. The relationship between 
the local and global degrees-of-freedom is based on the requirement that 

(12) u 2 = qi , u 3 = q 2 , u N+3 = q 3 . 

The subscripts 2 and 3 for u are the global labels of the degrees-of-freedom 
of the end nodes, which are labeled as the 1 st and 2 nd nodes of the element. 
The subscript N + 3 is the global label of the midpoint node at X = 
(X 2 + X 3 )/2, Fig. 18.2:l(b), which is designated as the 3 rd node of the 
element. The representation Eq. (9) with the interpolation function in the 
form of Eq. (11) can be used for any element. Similarly, one can establish 
representations of higher order (Tong and Rossettos 1977). 

Problem 18.1. Derive the Lagrangian interpolation, which gives unity at 
£ = and equals zero at other m points. 
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Element Matrices. How q’s relates to other field quantities and how 
the values of q’s will be determined depend on the nature of the problem 
and the governing differential equations or the variational formulation for 
the problem. These are the roles of element matrices. We shall discuss 
the construction of element matrices using the variational functional of 
the element. 

Consider a rod subjected to a distributed load P(X) along its axis with 
boundary conditions 

(13) u = u at X = 0 (rigid condition) 

(14) EA(L)-r— —T at X — L (natural condition). 

dX 

The longitudinal deformation satisfies the second-order differential equation 

< 15 > s [“<*>&]- , w- 

We subdivide the domain into elements and introduce the natural co¬ 
ordinate £ = (2X — X n — X n +i)/e, £ = X n +i — X n for the element 
X n < X < X n+l (Fig. 18.2:1). We express the elastic modulus EA and 
the load P in terms of the natural coordinate and denote them by lower 
case symbols as 

ea(0 = EA(X ), p(0 = P(X). 

Following the derivation of Sec. 10.1, the functional associated with the 
principle of minimum potential energy is 

<16) n - f [-■¥+ p < x >“]“ - r -f>- 

where 

„ f 1 e\2ea(£)dudu 1 ,, - 

" “ L 21“ d( ii + ' ,(0 “J '* ~ ■ 

in which a is a constant that a = 0 for n < jV and a — 1 for n — N with 
node N +1 at X = L. Thus q 2 [— u{L)\ is the unknown parameter associated 
with the node at X = L in the last element. The admissibility condition 
requires that u be C° continuous over 0 < X < L and satisfies the rigid 
boundary condition Eq. (13) at X = 0 (see Sec. 10.6). The admissibility 
condition assures the sum of the functionals of all elements equal to the 
functional of the whole domain, a feature of great importance to the finite 
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element method. We call II a functional and u the field or field variable of 
the functional. 

The variational principle requires that the first variation of the func¬ 
tional with arbitrary variation 5u equals to zero, i.e., 

(17) 5U = [ L \eA(X)^ l ~ + P(X)<sJ dX - TSu(L) 

Jo [ dX dX 

N 

= £<m„ = o, 

71=1 

where 

(18) sn n = £ - I* [1^) ^^ + p(Z)6uj d£ - aSq 2 T. 

In Eq. (18), u and its variation Su must be C° continuous for 0 < X < L, 
and u = u, Su = 0 at X = 0 as required by the admissibility condition. 
The function with its variation satisfying these conditions are called the 
admissible function associated with the functional. The determination of 
the solution based on the vanishing of the first variation of a functional 
as given in Eq. (17) for arbitrary admissible functions is called an integral 
formulation. 

The finite element method constructs an approximate solution based on 
an integral formulation using a selected set of admissible functions. The 
linear and the quadratic interpolation functions discussed before give con¬ 
tinuous u in the whole region and are, therefore, admissible to the present 
problem. The rigid condition is satisfied by requiring uj[= u(0)] equal to u 
and Sui = 0 which is equivalent to imposing constraint(s) on the system of 
algebraic equations to be discussed later. The boundary condition Eq. (14) 
at X = L is accounted for by the term u(L)T in II or aq-fT in II„ of the last 
element. There is no restriction on u and Su at X = L since the condition 
is a natural boundary condition (see Sec. 10.6). We call the boundary with 
prescribed natural boundary conditions a force boundary. 

Substituting Eqs. (5) and (6) or Eqs. (10) and (11) into Eq. (18), and 
using u = £\ hj(€)qj and Su - hfiQSqj give 


sn n = ^8 qiqj \j_™{i) d ^ d jidi 

+ f_ x P(0hi(0 df - aSq 2 T , 


(19) 
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where the range of summation is from 1 to 2 for linear interpolation and 
from 1 to 3 for quadratic interpolation. The rigid condition at X = 0 is 
satisfied by requiring 

( 20 ) qi = u, 5qi = 0 

in the first element whose first node is at X = 0. 

Equation (19) can be written in the matrix form 

(21) <5II„ = <5q T Q = <5q T kq — 5q T f, 


where 

q = [qi <72 • • \ T , Q = kg - f 

are, respectively, the generalized element nodal displacement and force ma¬ 
trices of the element, k is the element stiffness matrix and f is the element 
external or applied force matrix from the applied loads distributed within 
the element and on the element boundary. The terminology of stiffness 
and force is borrowed from structural mechanics. Note that q, Q, and f 
are column matrices. In general, q and Q may not have the dimension of 
displacement and force, respectively. The components of k and f are 


, 2 f 1 dhi dhj 

(22) A kij = - j^ea{Z) — —dl;, 

(23) A f i = -i£ p(0hi(t) + S a aT , 


in which the range of i,j is from one to the number of degrees-of-freedom 
of the element and Jy is the Kronecker delta. 

The element stiffness and external force matrices relate the element 
nodal displacement q uniquely to an element nodal force Q. The unique 
relation is an essential requirement for the interpolation functions that it 
does not allow assuming the displacement u as a constant within an element 
because it leads to zero stiffness for the element. However, it is legitimate 
for the solution q to have all its components be of the same value. 

We see from Eqs. (22) and (23) that the element matrices are indepen¬ 
dent of the boundary conditions except for the element on the force bound¬ 
ary where the element applied force matrix involves the applied boundary 
force. The element stiffness matrix k is symmetric and positive semi-definite 
provided that ea(£) > 0. It has one zero eigenvalue associated with the 
eigenvector q\ — q 2 , called the zero eigenvector. Physically, the zero eigen¬ 
vector (s) associates with rigid body motion and gives zero strain energy. 
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Thus, the rank of k equals the number of q's minus the number of zero 
eigenvectors, which is one in the present case, if the shape function can 
represent constant du/dX exactly within element. 


Linear Element. For linear interpolation, there are only two degrees-of- 
freedom per element 



<h 

, Sq = 

6 qi 

q = 



q 2 _ 


S q 2 


If ea and p are or can be approximated as constant, the integration of 
Eqs. (22) and (23) gives 


(24) 



-1 

1 


(25) 


f = 


h 


1 

+ a 

0 


_ 



h 

2 

1 


T . 


In this case, the rank of k is 1 . The finite element process lumps half of 
the applied distributed load over the element to each node. 

Since all interpolation functions involved are linear in X, we call the 
element a linear element. 


Quadratic Element. We use higher order approximations for the field 
variable to derive higher order elements. Using the quadratic interpolation 
functions of Eq. (11) and linear approximation for ea and p, i.e., 

ea(£) = eao + eaie£/ 2 , 


P(0 = Po +Pi£f/2, 


where ea 0 ,eoi,po,Pr are constants, we obtain from of Eqs. (22) and (23) 


(26) k 


ea o 
~3sT 


7 

1 

-8 


sym 

7 

-8 16 


2 

+ 3 Cai 


-1 sym 

0 1 
1 -1 0 


(27) f 


1 


-1 


0 


Pie 2 




1 

12 

1 

+ a 

T 

A 


0 


0 
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Since the interpolation functions involve quadratic functions of X, we call 
the element a quadratic element. If is zero, the finite element process 
lumps 1/6 of distributed load over the element to each of the end nodes 
and 2/3 to the midpoint node. There are three degrees-of-freedom for the 
element. The rank of k is two if eao and eai are both positive. The 
rank remains to be two for a limited range of negative ea\. We leave it 
to the reader to show that such a negative ea\ outside the range limits is 
unphysical. 

One can similarly derive elements of even higher orders (Tong and Ros- 
settos 1977). Analytical integration of Eqs. (22) and (23) is not esssential 
because we can always evaluate the integrals numerically. In certain prac¬ 
tical applications to be discussed later, one sometime purposely uses lower 
order approximation in integration to reduce the rigidity of the stiffness 
matrix. Also, in practice, ea or p may be available only in numerical form 
that Eqs. (22) and (23) can only be integrated numerically. The formation 
of the system of algebraic equations for the whole domain from the element 
matrices and the solution of the system of equations will be discussed in 
later sections. 


Problem 18.2. Find the condition that the rank of k in Eq. (18.2:26) is 
one and explain why such a condition is unphysical. 


18.3. SHAPE FUNCTIONS AND ELEMENT MATRICES FOR 
HIGHER ORDER ORDINARY DIFFERENTIAL 
EQUATIONS 


The interpolation functions and element matrices derived up to this 
point are not suitable for problems governed by differential equations of 
the order higher than 2. Consider the bending of a beam 


( 1 ) 


d 2 

dX 2 


EI 


- P(X) = 0 


over 0 < X < L, with boundary conditions 


( 2 ) 

(3) 


u = it, 


du/dX = u', 


at X = 0 (rigid condition) , 


d 2 u - d 
EI ^dX 2 = M ’ dX 


EI <*>z£ 


= 0 , 


at X = L (natural condition), 

where (•) denotes that the concerned quantity is prescribed. The func¬ 
tional for the principle of minimum potential energy of beams is given in 
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Eq. (10.8:3). The first variation of the potential energy of an element in 
terms of the local coordinate is 


^ frr £ f 1 r i6ei (0 d2 u d 2 6u 

a) sn ” = 2 y_, [-R- sp sp - ?<f)*“ 


di 


( 2 M du = \ 

- a {-rrr Qu ) 


where the lower case e«(£)[= EI(X)\ is used to denote the bending rigidity 
expressed in terms of the natural coordinate f and a is a constant that 
a = 0 for all elements except for the one containing the node at X = L, 
then a = 1. The admissibility condition requires that u be C 1 continu¬ 
ity, i.e., u and du/dx of adjacent elements are equal at the common node 
and that u satisfies the rigid boundary conditions Eq. (2) at X — 0. The 
boundary conditions at X = L are natural conditions associated with ap¬ 
plied boundary moment and shear force. The boundary point at X — L is 
said to be a force boundary. 

Obviously, u is not C 1 continuous over the whole domain 0 < X < L 
if u is defined as in Eq. (18.2:5) or (18.2:10) with the interpolation func¬ 
tions given in Eq. (18.2:6) or (18.2:11) and q's satisfying Eq. (18.2:7) or 
(18.2:12), respectively. We need to construct new interpolation functions 
to make u meet the C l requirement. Let us define u 2n -i = u(X n ) and 
U 2 n = du(X n )/dX at the node X = X n as unknown parameters for the field 
variable and call them the generalized coordinates or degrees-of-freedom. 
Now there are two degress-of-freedom at each node and they have differ¬ 
ent physical dimension. The interpolation functions can be constructed as 
follows. First we assume u in the form 


u = ai + a 2 (X - X n ) + a 3 (X - X n ) 2 + a 4 (X - X n f 
or in terms of the local coordinate £, 


(5) u = qihi(Z) + q 2 h 2 (£) + q 3 h 3 (£) + q 4 h 4 (^) 


for the element between X n and X n +i, where f has a range from —1 to 
1. The interpolation functions h's have the properties that hi and are 
zero at £ = — 1 and 1 except for 


fti(-l) = Ml) = l, 


dh 2 (-1 ) 

dt 


dh 4 (l) 

df 


£ 

2 ’ 


An interpolation function that satisfies these conditions is known as the 
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Hermite interpolation. One can show that 

hid) = (2 + - 1)74 , hid) = HZ + 1 )(£ - 1)78 , 

(6) ▲ 

h*d) = (2 - £)(* + 1)74, /u(0 = ^ - m + 1)78. 

In this case, qi,qz are the nodal values of u and the values of du/dX 
at £ = ±1, respectively. If we require 


(7 


U2n-1 = u(X n -l) = qi , 

W 2 n+i = u(X n ) = Qz , 


U2n = 


du(X n - 1 ) 
dX 


- , 


u 2n+2 


du(X n ) 
dX q ' 


then u in the form of Eq. (5) is C 1 continuous over 0 < X < L. 
A substitution of Eq. (5) into Eq. (4) yields 

m u = 5q T (kq-f) 


for the element where the components of k and f are 


(8) 


- 1 

. l ei ^ de d$ 2 d 

(9) 

* 7 / 

p(£)M£ + a(S i4 M - S i3 Q ) 

l 


with a and 5's defined as before. The element stiffness matrix k is sym¬ 
metric and positive semidefinite provided that ei(£) > 0. It has two zero 
eigenvectors associated with the rigid body translation and rotation. Thus, 
if Eq. ( 8 ) is integrated exactly, the rank of k equals the number of q’s minus 
2. One often integrates Eq. ( 8 ) approximately by numerical means. The 
approximation may lower the rank of k and introduce spurious deformation 
mode(s), i.e., nonrigid body motion with no contribution to energy. 

If ei and p are treated as constant within the element, k and f can be 
written as 


12 


6 s 

4 £ 2 

-12 

— 

6 £ 

2 e 2 


(10) 


12 

6 e 4e 2 
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( 11 ) 


6 


0 

£ 

+ a 

0 

6 


-0 

—£ 


M 



Each element involves 4 degrees-of-freedom and each node has two. For an 
element with its 1 st and 2 nd nodes being the /{ h and 1%' global nodes, the 
relationship between the global and the local labels is given in Table 18.3:1 
as an array. The arrays for all elements, denoting by I, is called the assembly 
list that each array of the list specifies the global labels of an element. The 
list is needed in the assembly process to form K and F. 


Table 18.3:1. Global and local degree and nodal label. 


Global degree 
label 

Global nodal 

label 

Local degree 

label 

Local nodal 

label 

2/ x - 1 

h 

1 

1 

2/i 

h 

2 

1 

2/ 2 - 1 

h 

3 

2 

2/2 

h 

4 

2 


EYom Eqs. (10) and (18.2:24), we see that k is independent of loads 
and boundary conditions. Equations (11) and (18.2:25) show that interior 
loads contribute to the element applied force matrix f. The boundary loads 
contribute to f of the elements on the force boundary only. 

One can use Eqs. (8) and (9) to evaluate element matrices for higher 
order elements provided higher order shape functions are used. For an ele¬ 
ment with a total of to nodes at £j,j = 1 ,..., m, the interpolated function 
is in the form 

2 m 

U = Y1 ?A(0. 

i =1 


where 


h 2 j_i 


dufa) 

<Z2j-i = «(<#), <72j = , 

\ . <ty>j(&)&-€ 1 MS) h e($-Wj(Q 
J Vbfo) ’ 2:1 2V'j(^) 


(12) 



642 


FINITE ELEMENT METHODS 


Chap. 18 


in which j 's are not summed and 


= 


1 




Obviously, the interpolation functions given in Eq. (6) or (12) are ap¬ 
plicable to the differential equation such as 

< 13) IT’ [ Elm I^\ - lx [ Nm Ix[ + *<*>“ - = o 

with the first variation of the potential energy in the form 


£ f 1 16 ei(£) d 2 u d 2 5u 4n(£) du d5u 

2j_ x e 4 + 


+ k(£)u5u — d£ + B , 

where ez(£) = EI(X), n(£) = N(X), k(£) = K(X), p(£) = P(X) within the 
element, and B involves terms related to natural boundary conditions if the 
element has a force boundary. The element matrices can be computed in a 
similar fashion and assembled into a system of simultaneous equations for 
numerical solution by computers. The finite element process reduces 
the task of solving differential equations to numerically solving 
algebraic equations. The complexity of constructing analytical 
solutions is replaced by the routine integration, which can be 
performed numerically, in evaluating the element matrices. 

Problem 18.3. Show that the rank of k given in Eq. (18.3:10) is two. 

Problem 18.4. Using the interpolation functions defined in Eqs. (18.2:6) 
and (18.3:6), to determine the matrices associated with k(£)u8ud£ for k = 
ko + k x et/2. 

Problem 18.5. Consider the bending of a beam as defined in Eq. (18.3:1) 
with prescribed boundary conditions 

u = u , du/dX = u , at X = L . 

(a) Show that the first variation of the potential energy of an element is 


e=-i 

(X=0) 


-§/; 


16ei(£) d 2 u d 2 Su 

e 4 d? Up 


p(£)5w d£ - a ( Qu 
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where a = 0 for all elements except for the one containing the node X — 0, 
then a = 1. The equation above has a same form as Eq. (18.3:4) except 
for the last term, which has a sign difference. This shows clearly that the 
element stiffness matrix does not depend on the boundary conditions. 

(b) Explain the reason for the sign difference. 

(c) Derive the element force matrix for the element with the node at X=0. 

Problem 18,6, Repeat the previous problem with prescribed boundary 
conditions 


mx)^% = m 



at X = 0 , 
at X = L. 


18.4. ASSEMBLING AND CONSTRAINING GLOBAL 
MATRICES 

We need to determine matrices K and F to solve for the unknown q' s. 
It is K and F that relate the nodal generalized coordinates of one node to 
those of all other nodes. The finite element method divides the solution 
process in three steps: 

1. Assemble the element matrices into the global matrices based on 
Eq. (18.2:17) with 51I n in the form of Eq. (18.2:21). <5II n is given in 
Eq. (18.2:19) for rods and Eq. (18.3:4) for beams. 

2. Impose rigid constraints to the assembled matrices, i.e., enforcing the 
rigid condition such as Eq. (18.2:20) or (18.3:2). 

3. Solve the system of algebraic equations by computers. 

The process of assembling element matrices to form K and F can be 
symbolized as 

m m 

where the subscript m denotes the m th element. Assembling is actually a 
numerical summing process to carry out the summation of Eq. (18.2:17) 
for all elements. Starting with null matrices K and F, we add to them the 
contributions from k and f of each element. When the last element has 
been added, the global K and F are formed. We have implicitly assume 
that all elements have the same definition of generalized coordinates at 
their common node(s) to allow the direct addition of the components of k 
and f from different elements. If the generalized coordinates of an element 
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are different from those of the global system, a transformation of k and 
f is necessary before addition. Such transformation will be discussed in a 
later section. 

Let the components of K and F be Kij and Fj, respectively, while 
those of k m and f m of the m th element be ( kij) m and (/j) m > where I and J 
denote global degree-of-freedom labels and i and j denote local labels. The 
key in assembling is to know the global degree-of-freedom labels, I and J, 
of the i th and j th degrees-of-freedom of the m th element being assembled. 
This information is contained in the assembly list. Then, we simply add 
( kij) m and (fi) m to the corresponding Kjj and Fj, respectively. 

For example, for the linear element considered in Sec. 2, the relation 
between the global and local degree-of-freedom labels are described in 
Eq. (18.2:7), i.e., the global degrees-of-freedom 2 and 3 are, respectively, the 
1 st and the 2 nd degrees-of-freedom of the element. Therefore, assembling 
the element is simply to 

add k n to K 22 add kn to K 23 
add & 2 i to K 3 2 add /c 2 2 to K 33 
add /j to F 2 add / 2 to F 3 . 

Another example is the quadratic element shown in Fig. 18.2:1. The rela¬ 
tion between the global and local labels of the element of Eq. (18.2:12) is 
as follows: 


Global label 

Local label 

2 

1 

3 

2 

N + 3 

3 


where N is a known integer depending on how the global nodes are num¬ 
bered. Assembling the element is simply to 

add ku to K 22 

add /c 2 i to K 32 add fc 2 2 to K33 

add to Kn+ 3,2 add /c 32 to Kn+ 3,3 add fc 33 to -F/v+3,N+3 

add fi to F 2 add / 2 to F 3 add / 3 to -F/v+ 3 - 

Only the assembling of the lower triangle of k is illustrated above because 
k and K are symmetric. 

In general, the assembly procedure can be described as follows: Let I r 
be the global degree-of-freedom label of the r th degree-of-freedom of the 
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element being assembled and p be its number of degrees-of-freedom of the 
element. Then the assembly of the element is simply 

(1) add k ra to Kj r j t , add f T to F[ r , for r, s = 1,2,... ,p . 

The rest of the components of K and F do not change in assembling this 
element. The array / with components I r ,r = 1,2,..., p, represents the 
transformation from the local degree label r to the global degree label I r 
for the element being assembled. 

In many practical problems, k and K are symmetric, and we therefore 
only have to assemble the lower (or the upper) triangle of K using the lower 
(or upper) triangle of k. In this case the process will have to be modified 
slightly 

(2) add k ra to Kmn , add f r to Fj r , for r = 1,2..., p; s = 1,..., r . 
where 

M - Max(/ r , / s ), N = Min(/ r , I s ). 

After we proceed through all the elements, we obtain the final global ma¬ 
trices K and F. 
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Fig. 18.4:1. Local and global labels for three-finite-element division of a domain, (c) and 
(d) are two alternate global label schemes. 



646 


FINITE ELEMENT METHODS 


Chap. 18 


Consider the problem described by Eqs. (18.2:13)—(18.2:15) with a three- 
element division. The global labels of the generalized coordinates of the 
label scheme are shown in Fig. 18.4:l(a). The assembly list, the global 
labels of the generalized coordinates of all elements, is given in Table 18.4:1 
below 


Table 18.4:1. Assembly list. 


Element 1 

1 

2 

- 

Element 2 

2 

3 

- 

Element 3 

3 

5 

4 


Note that the assembly list is a two-dimensional array. It is important to 
note that for element 3 the global label of its 2 nd degree-of-freedom is 5 and 
that of its 3 rd degree-of-freedom [the one associated with the interior node 
of the element as defined in Eq. (18.2:10)] is 4. This information is essential 
for the computer to assemble the element matrices correctly. The first two 
elements have two degrees-of-freedom each with element matrices given in 
Eqs. (18.2:24) and (18.2:25). Element 3 has three degrees-of-freedom and 
its element matrices are given in Eqs. (18.2:26) and (18.2:27). 

Consider the problem described by Eqs. (18.2:13)—(18.2:15). Applying 
the assembly procedure to the three-element division of a domain as shown 
in Fig. 18.4:1(a) yields 



(P^/2)i 

—(pe/2)i - (pe/ 2) 2 
(4) F= — (pe/2) 2 - (pe/6) 3 
~(2pe/3 ) 3 
-(pe/6)3 + T 
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in which the subscript (-)j signifies that the quantity in the parenthesis is 
from the j th element. In Eqs. (3) and (4), for simplicity, we have set a\, p\ 
of Eqs. (18.2:26) and (18.2:27) to zero. 

At the beginning of the assembly process, all entries of K and F are set 
to zero. In assembling the first element, the quantities with subscript 1, i.e., 
(•)i, are added to K and F in Eqs. (3) and (4). After the second element is 
assembled, the quantities with subscripts 1 and 2 have been added; and so 
on. In this case, when the third element is assembled, the term associated 
with T is added to F 5 to account for the contribution from the natural 
boundary condition at the 2 nd node of the last element. The global label 
for the degree-of-freedom associated with this node is 5. 

Up to this point we have not discussed the rigid boundary conditions. 
The K as assembled is in general singular, 1 i.e., there exists at least one 
non-zero column matrix, say uo, such that Kuo = 0. For example, the 
column matrix with all components being the same is obviously one such 
column matrix. We must constrain K before we can solve the equation 
Ku = F for u. Constraints are related to enforcing the rigid boundary 
conditions. In the finite element method, the rigid boundary condition(s) 
is enforced as part of the admissibility requirements whereas the natural 
boundary condition(s) is automatically accounted for through the integral 
formulation. For the present problem u = uatA=0isa rigid boundary 
condition, which must be enforced, i.e., one must have 5ui = 0 and uj = u. 
Therefore, of the five nodal parameters ( 111 , 112 , 113 , 114 , 115 ), only four are 
actually unknowns. Constraining means using just the last four equations 
of Ku = F together with u\ = u to solve for the four unknowns. 

The actual procedure of enforcing the rigid boundary conditions in the 
finite element method is quite simple. Let us call the components of u asso¬ 
ciated with rigid boundary conditions the constrained degrees-of-freedom. 
The constraining procedure is as follows. We multiply the components of 
the columns of K associated with the constrained degrees-of-freedom by 
the corresponding constrained values, and subtracts them from F. Then 
we remove the rows of the resulting F and the rows and columns of K 
associated with the constrained degrees-of-freedom to obtain the reduced 
set of equations denoted as 

(5) K*u* = F*, 

where u* is the column matrix u excluding the constrained degrees-of- 
freedom. This is called the compact process, which physically reduces the 


1 Except for the case of Eq. (18.3:13) with N(X) and K{X) > 0. 



648 


FINITE ELEMENT METHODS 


Chap. 18 


size of the global matrices. For the present case, u\ is constrained to be u. 
One modifies F by subtracting it from the first column of K multiplied by 
u, removing the first row of the resulting F to form F*. One also removes 
the first row and the first column of K to form K*. The column matrix u* 
involves the four components ( 112 , 113 , 114 , 11 , 5 ). The resulting K* and F* are 



—(pe/2)i - (pe/ 2)2 + u 

F * = -(pe/2)a - (pe/6) 3 

~(pe/ 3)3 

(pe / 6)3 + T 

From computational point of view, reducing the number of equations to 
enforce the rigid constraints may not be convenient for large number of 
equations. The process involves altering the data structure for storing K 
and F, even though it reduces the number of equations to be solved. A 
practical way to enforce the constraints is to change the numerical value of 
some components of K and F as shown below 
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~(pe/2)i - (pe/2) + (ca/e) x u 

~(P £ / 2)2 - (pe/6)3 

-0>e/3) 3 

-(pe/6) 3 +T 


The process involves first multiplying the components of the columns of 
K associated with the constrained degrees-of-freedom by the correspond¬ 
ing constrained values, and subtracting them from F. One then sets the 
rows and the columns of K associated with rigidly constrained degrees-of- 
freedom to zero except for the diagonal term, which is set to 1. In the mean 
time, the rows of F associated with the constrained degrees-of-freedom are 
set to the corresponding constrained values. The solution of the modi¬ 
fied equations 


(7) K*u = F* 

automatically gives the correct values for the constrained degrees-of- 
freedom. This is called the constraining process. 

In the present case, only the first degree-of-freedom is constrained with 
the value of u. We set the first row and the first column of K to be zero 
except for K\\, which is set to 1. We modify F and set F\ to u. The process 
may look cumbersome but can be implemented easily and efficiently in high¬ 
speed computation because K, F and K*, F* have the same data structure. 

Constraints can be imposed as each individual element is being as¬ 
sembled or after the assembling of all elements has been completed. For 
convenience, we will use the notation 

(8) £ k <i = £ f 

to represent symbolically the assembled and constrained finite element 
equations Eq. (5) or (7). 

No special effort is required to enforce the natural boundary conditions. 
As mentioned before, they are accounted for by the integral formulation 
and, therefore, are satisfied automatically in an approximate sense in the 
finite element method. This makes it easy to meet natural boundary con¬ 
ditions. The influence of natural boundary conditions is in fact reflected in 
the generalized force matrix, e.g., in the element applied force matrix for 
elements including boundary with prescribed natural boundary conditions. 
In the present case, EA(L)du/dX = T at X = L is a natural boundary 



650 


FINITE ELEMENT METHODS 


Chap. 18 


condition. The term involving T in the last element accounts for the con¬ 
dition in the integral formulation. 

The ease of handling boundary conditions, rigid as well as 
natural, for any boundaries is an important advantage of the finite 
element method over other approximation methods such as the finite 
difference method. 

The matrix K shows the banding or sparseness characteristics typical 
in the finite element method. One can determine the bandwidth of any 
degrees-of-freedom from Eq. (18.2:17). The bandwidth of a row of a matrix 
is defined as the number of components, named also as entries, between 
the first to the last nonzero components of the row. Another often-used 
term is the semi-bandwidth, which is defined to be the number of compo¬ 
nents between the first nonzero entry of the row to the diagonal. For the 
one-dimensional example, since Uj appears at most in two elements, if we 
sequentially label the nodes and parameters in the global system, the semi¬ 
bandwidth is at most two for nodes associated with linear elements as given 
in Eq. (18.2:6). For nodes associated with quadratic element as given in 
Eq. (18.2:11), the semi-bandwidth is three. The semi-bandwidth depends 
strongly on the way the global degrees-of-freedom are labeled. For the label 
scheme shown in Fig. 18.4:l(c), the semi-bandwidths for ui,U 2 ,U 3 , 14 , 1/5 
are 1, 2, 2, 2, 3, respectively, while the corresponding semi-bandwidths are 
1 , 2, 2, 4, 4 for Fig. 18.4:l(d). 

The semi-bandwidth strongly affects the computer storage of the as¬ 
sembled matrices and the computer time needed for solving the system of 
equations. This is important for large system of equations. For the vari¬ 
able bandwidth solution algorithm (Tong and Rossettos 1977), the optimum¬ 
labeling scheme is the one that approximately gives the minimum average 
semi-bandwidth. Obviously for the simple problem considered, the labeling 
scheme shown in Fig. 18.4:1(d) does not give a minimum semi-bandwidth. 
However, for more complicated 2- or 3-dimensional problems, determining 
the optimum-labeling scheme that minimizes the average semi-bandwidth 
of the global stiffness matrix is non-trivial. Readers are referred to the 
extensive literature on this subject (e.g., George and Liu 1981). 

One can construct K and F in a similar fashion for more elements. 
Thus one can refine the analysis in a systematic manner to improve the 
accuracy of the solution. One may also construct K and F for higher 
order elements such as the three-node element with the element matrices 
defined in Eqs. (18.2:26) and (18.2:27). Each element has three param¬ 
eters, i.e., three degrees-of-freedom. If all elements have three nodes, K 
and F will involve 7 degrees-of-freedom for the 3-element domain. The 
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semi-bandwidth is 3 for the degrees-of-freedom associated the element 
boundary nodes and is 2 for those associated with the interior nodes of 
elements if the global degrees-of-freedom are labeled sequentially. For the 
same number of elements, the system associated with higher order elements 
has more degrees-of-freedom and higher semi-bandwidth for K. In the mean 
time, the higher order system generally gives a more accurate solution. 

Problem 18.7. Construct the four-node Lagrange interpolation functions 
over 0 < x < e. The four nodes are the two-end nodes and two equally spaced 
interior nodes. 

Problem 18.8. Consider the problem defined by Eqs. (18.2:13)—(18.2:15) 
with EA being constant over the entire domain, (a) Show the finite element 
solution using linear elements always gives the exact solution at the nodes for 
any loading distribution p(X) and any number of elements of non-uniform size. 
(Hint: see Tong 1970a) (b) Do you expect the same is true if quadratic elements 
are used? Prove your result. 

18.5. EQUATION SOLVING 

For convenience we shall drop the superscript V of K and F in the sub¬ 
sequent discussion. The finite element analysis inevitably ends up solving a 
large set of linear algebraic equations like Eq. (18.4:7). The feasibility of the 
application of the finite element method hinges on ability to solve a large 
system of equations by computers in a reasonable time and with accept¬ 
able accuracy. The second point associates with round-off error, which can 
render the numerical solution meaningless. The first point depends on the 
operation counts involved in the solution algorithm for a linear system of 
equations. Usually the resulting K from the finite element method is sym¬ 
metric and sparse with many of its nonzero entries lying within a narrow 
band of the diagonal. Sparse is defined as having only few components or 
entries, of K being nonzero. Both the sparseness and the semi-bandwidth 
of each row play important roles affecting the operation counts of a solution 
algorithm. 

There are two kinds of methods for solving a large system of equations: 
(1) direct methods, and (2) iterative methods. A direct method is one, 
which after a finite number of operations, if all computations were carried 
out without roundoff, would lead to the “exact” solution of the algebraic 
system. An iterative method usually requires an infinite number of itera¬ 
tions to converge to the “exact” solution. Detailed discussion of solution 
methods is beyond the scope of this book. We shall just briefly outline 
the concept of direct methods and identify a number of iterative schemes. 
Readers are referred to literature for details. 
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The most commonly used direct methods of solution are the Gauss 
elimination and the triple factoring method. The idea underlying the Gauss 
elimination is simple. For a system of n algebraic equations, we use the first 
equation to eliminate the “first” variable u\ from the last n — 1 equations, 
then use the second equation to eliminate the second variable U 2 from the 
updated last n — 2 equations, and so on. After n — 1 such elimination, 
the resulting matrix equation is triangular. This process is termed forward 
elimination. At this stage the last equation involves only one unknown u n , 
which can be determined. Then one can determine w„_i from (n — l) th 
equation, and so on until all unknowns are obtained. The last procedure is 
called backward substitution. 

Many time saving techniques can be applied to reduce the operation 
counts of this method. The main one is to recognize the zero entries of 
K in the elimination procedure. During the elimination of the j th variable 
from an equation, if the coefficient associated with this variable is zero, 
all operations to eliminate this variable from the equation can be skipped. 
For a sparse matrix, this step provides great timesaving. Recognizing zero 
entries (components of K) in K efficiently and skipping the unnecessary 
operations are keys to the success of implementation of the finite element 
method in practical applications. 

The essential information enabling us to recognize the unnecessary oper¬ 
ation in both the forward elimination and backward substitution processes 
is the column number of the first nonzero entry of each row, denoted by C(i) 
for the i th row. This means K z j — 0 for j < C(i). The semi-bandwidth of 
the i th row is simply i — C(i) +1 . All operations to eliminate the variable 1 
through C(i) — 1 from the i th row can be skipped in forward elimination. In 
addition, there is no need to store Kij for all j < C(i). This helps greatly in 
reducing the data storage requirement in practice, as the semi-bandwidth 
of K usually varies greatly. Therefore, it is important to label the nodes in 
such way to give a minimum average semi-bandwidth. 

For symmetric K, the first zero row of the i th column of the resulting 
matrix after forward elimination is also C(i). This allows us to use the 
same computer storage for the lower triangle of K before forward elimina¬ 
tion process and for the resulting upper triangle after the process. This 
helps further reduce computer storage requirements. The process for de¬ 
termining C(i) and utilizing this information was discussed by Tong and 
Rossettos (1977). 

Another commonly used direct method is the triple factoring method. 
The basic idea is to express a symmetric K in the form 


K = LDL t , 
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where L is a lower triangular matrix with all diagonal terms being unity 
and the entries of the upper triangle being zero and D is a diagonal matrix. 
The process of determining L and D is named factorization. We can then 
easily solve the equation 

Lu* = F, 

because L is a lower triangular matrix. The procedure of solving for u* is 
called forward substitution. We can solve for u from 

L t u = D -1 u*, 

by backward substitution. One can show that L and K have the same C(i). 
Therefore one can use the same computer storage for L and D. One can also 
use the same storage for F, u*, and u as they are generated sequentially. 

Factorization in the triple factoring method or forward elimination in 
the Gauss elimination method are the most time consuming part of the 
solution procedure. For many practical problems, 85 to 95 percent of the 
total computational effort is for factorization or forward elimination. The 
multiplication count for both the triple factoring and the Gaussian elim¬ 
ination methods are of the order n 3 for a fully populate matrix of order 
n. The count is reduced to the order nB 2 for a banded matrix where B 
is the average semi-bandwidth. This count is further reduced if the vari¬ 
able bandwidth of the matrix is accounted for. In the process of forward 
elimination or fractorization of a highly sparse matrix, only some of the 
zero entries between the first nonzero entry and the diagonal of a row 
will be filled-in i.e., become nonzero, while most of zero entries remain 
untouched. Recognizing those unaltered entries and avoiding the opera¬ 
tions (i.e., multiplication by zero) to update them can reduce the operation 
even much further. Therefore, it is important to recognize the number of 
l filled-in’ in order to reduce both memory requirements and the number 
of arithmetic operations. Algorithms and software have been developed to 
reorder the original matrix K for this purpose. Unfortunately, an algorithm 
that is best to minimize the average semi-bandwidth is in general not the 
best for minimizing ‘filling-in’. The reversed Cuthill-McKee and Gipspoole- 
Stockmyer reordering algorithms (George and Liu 1981, Cuthill and McKee 
1969, Crane et al. 1976) are effective schemes for variable bandwidth (also 
called skyline) equation solution algorithms. The modified minimum degree 
and the nested disection reordering algorithms (George and Liu 1981, Lewis 
et al. 1989, Liu 1987, Gilbert and Zmijewski 1987, Ng and Peyton 1993) 
are more suitable for large sparse matrices. The details of efficient com¬ 
puter implementation of the triple factoring method based on the variable 
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band-width strategy can be found in Tong and Rossettos (1977). The de¬ 
velopment of efficient equation solvers for sparse matrices is an important 
area of on-going research. Work on the subject includes Chen et al. (1996) 
and Nguyen et al. (1996). 

Many advances in the direct method are in the adaptation for parallel 
computation. This is to having many central process units (CPUs) work¬ 
ing together that each CPU does part of the work of solving the equations. 
The computation process can be very involved. Since the direct method is 
inherently a sequential process, many operations require information from 
the results of earlier steps, which may be computed by different CPUs. 
Also, often the assembled global stiffness matrix is so huge, that no single 
computer can store all the information. When a large number of CPU’s is 
involved, communication between different CPU’s, and synchronizing the 
processing 2 can consume both CPU time and clock time. The speed up of 
computer time is not linearly proportional to the number of CPU used. Bal¬ 
ancing the work loads among CPU’s and minimize communications among 
them are essential for efficient parallel computation. Efficient parallel com¬ 
putation algorithms are under intensive development (Noor 1987, Belytscho 
et al. 1990, John et al. 1992, Law and Mackay 1993, Zheng and Chang 1995, 
Duff and Reid 1995, Golub and Vanloan 1996, Nyuyen et al. 1996). 

There are numerous iterative schemes. The methods proceed from an 
initial guessed solution u ( ' 0) and define a sequence of successive approxima¬ 
tion , which is required to converge to the exact solution. The procedure 
can be symbolished in the form 

Nu (r) = Pu (r ~ 1 > + F, 

where N and P are matrices derived from K. The matrix N is usually di¬ 
agonal or triangular so that u'D can be readily solved. In each step, most 
of the operations required are the multiplication of P and u^" 1 ). One 
advantage of some of the iterative methods is that the product can be com¬ 
puted at the element level. This makes it unnecessary to physically form 
the global P and allows the saving of computer storage and is easily adapt¬ 
able for parallel computation. The commonly used methods include Jacob’s 
iteration, the Gauss-Seidel iteration, the conjugate gradient method, and 
the variable metric method. Interested readers are referred to literature 
(Isaacson and Keller 1966). 

The accuracy of the solution by the direct method and the number 
iterations required by the iterative methods to reached a converged solution 

2 Often the needed information is generated by a different CPU. The task to generate 
the information may not have been completed when it is needed by another CPU. 
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depend strongly on the condition number, the ratio of the highest to the low¬ 
est eigenvalue of the constrained K. The condition number of the finite ele¬ 
ment equations can be estimated from the element stiffness matrices (Tong 
1970b, Hughes 1987). Reconditioning and reordering (including pivoting) 
K are often necessary in order to obtain numerical solution of acceptable 
accuracy (George and Liu 1981, Chen et al. 1996) for very large systems. 

Within a tolerable error, there is no clear-cut answer as to which method 
is best. Generally, the iterative methods are better for very large and 
highly sparse systems, usually over one hundred thousand equations. It is 
also easier to implement an iterative scheme for parallel computation if the 
scheme does not require the assembled global matrices. The direct methods 
are often faster, if they can be used at all. The practical considerations of 
adopting a specific method are which of the two methods is more familiar 
to the user and what computer programs are available. 

The finite element process of assembling, constraining, and solving al¬ 
gebraic equations is very general and applicable to one, two or three- 
dimensional problems. In assembling an element, one needs to know the 
assembly list, which identifies the global labels of the generalized coordi¬ 
nates of each element. In constraining the assembled matrices, one needs 
the information on which degree-of-freedom is to be constrained and its 
constrained value. Constraining can be performed as an element is being 
assembled or after assembling has been completed. This is a matter of 
choice in computer programming. In solving the algebraic method by a 
direct method, one needs to know the semi-bandwidth of each degree-of- 
freedom in the assembled stiffness matrix for efficient computation. This is 
extremely important not only for saving computer time but also for saving 
the computer storage for the global stiffness matrix. 

18.6. TWO-DIMENSIONAL PROBLEMS BY 
ONE-DIMENSIONAL ELEMENTS 

A class of two-dimensional problems solvable by one-dimensional ele¬ 
ments is plane truss. A plane truss assembly consists of members describ- 
able by one-dimensional elements. Consider a two-member plane truss as 
shown in Fig. 18.6:1. Each member can be approximated by one or more 
one-dimensional elements of Eqs. (18.2:24) and (18.2:25) or Eqs. (18.2:26) 
and (18.2:27). (More than one elements or higher order elements are needed 
for a truss member only if the trust member has nonuniform cross-section. 
See Problem 18.2) We use X, Y as the global coordinates for the plane truss 
and the nodal values of the displacements u, v in the X-, Y-directions as 
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Fig. 18.6:1. Two-member plane truss assembly. 

the generalized coordinates. Let x, y be the local coordinates of a truss 
member with x being along the axis of the member. Then, q’s in the one¬ 
dimensional element are nodal displacements along the ^-direction. In order 
to assemble an one-dimensional element in the plane truss application, we 
express the q’s of a node in terms of the nodal values of u, v, which are 
the common generalized coordinates for all elements at the same node. We 
then derive the transformed element matrices that are associated with the 
common generalized coordinates. By so doing we can then simply add the 
transformed element matrices to form the global matrices. 

Assuming uniform crosssection for the member, the element stiffness 
and force matrices are in the form given by Eqs. (18.2:24) and (18.2:25) 
with q’s being the axial displacements at the nodes. Prom Fig. 18.6:1, q’s 
are related to u,v by rotation. Let q 2 i-i = u, and q^i = iq for i = 1,2. 
Then the relation between q’s and q’s is 


qi = q2i-ic + q2iS, 

where c = cos 8, s = sin# with 6 denoting the orientation of the element 
with respect to the X-axis and i is the local nodal label. In the matrix form 


q = Rq, 


where R is a rotational matrix defined as 


<Zi 

, R = 

c s 0 0 

92 


0 0 c s 


q T = [qj q 2 q 3 ^ 4 ] for 2 -node element, 



Sec. 18.7 


GENERAL FINITE ELEMENT FORMULATION 


657 




c 

s 

0 

0 

0 

0 ' 

Q2 

, R = 

0 

0 

c 

s 

0 

0 

. 93 . 


_ 0 

0 

0 

0 

c 

s 


q T = [q \,..., %] for 3-node element, 

The element matrices associated with q can be derived based on the first 
variation of the functional for an element 

<5cf(kq-f) = <5q T (kq-f). 

The element matrices associated the q are 

k = R T kR, f = R r f. 

This process is called rotation. 

In terms of q's the 2- and 3-node elements have 4 and 6 degrees-of- 
freedom, respectively. We can determine the assembly list, which specifies 
the global degree-of-freedom label I r ,r = 1,2of the r th degree-of- 
freedom of each element, where p is the number of degrees-of-freedom of 
the element. We can then assemble the element matrices as described in 
Eqs. (18.4:1) and (18.4:2) 

Similarly one can generate 6- and 9-degree-of-freedom 3-dimensional 
elements from the 2- and 3-node one-dimensional elements. This is accom¬ 
plished by expressing the axial displacement of the one-dimensional element 
in terms of the 3 displacements in the 3-dimensional space. 

18.7. GENERAL FINITE ELEMENT FORMULATION 

We have demonstrated the construction of finite element equations for 
rods (governed by a second order ordinally differential equation) and beams 
(governed by a fourth order ordinally differential equation). The process in¬ 
volves steps including: dividing the domain into a finite number of discrete 
elements, evaluating the element matrices based on the functional of a varia¬ 
tional principal, assembling the element matrices based on the condition that 
the functional of the whole system equals to the sum of the functionals of all 
elements, imposing constraints to enforce the rigid boundary conditions, and 
finally solving the constrained algebraic equations numerically by high-speed 
computers. 

The determination of element stiffness matrices requires the establish¬ 
ment of the relations between the generalized element nodal forces and 
nodal displacements. In the process, the local approximate solution is ex¬ 
pressed in terms of nodal parameters or generalized coordinates interpolated 
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by interpolation functions. The nodal parameters and interpolation func¬ 
tions are chosen to satisfy the admissibility requirements of the functional. 
Such a concept can be generalized to problems involving several unknown 
functions in two- or three-dimensional domain. 

Consider a general problem of seeking the solution u(X) of the differ¬ 
ential equations 

(u) 

(1) A(u) = A 2 (u) = 0 

in a domain V, subjected to the boundary condition 

£i(u) 

(2) B(u)= S 2 (u) =0 

on the boundaries dV of the domain, where X represents the spatial coor¬ 
dinates or the independent variables, u is a vector of several variables and 
is a function of X, A’s are differential operators defined in V and B’s are 
differential operators defined on dV. Thus, the differential equation may 
be one or a set of simultaneous equations. 

We seek the solution in the form 

(3) SIl= [ W(u,^u)dC+ [ w{u, <5u) dS = 0, 

J V JdV 

for all admissible u and 5u, where W(u, <fu), ie(u, <5u) are the variation of 
the integrands of a functional, or simply A(u)<5u and B(u)<5u, respectively. 
The admissible requirements are: the functions u and <5u must be sufficient 
smooth to allow (in be defined mathematically, and in addition, u must 
satisfy the rigid boundary conditions (see Sec. 10.6) and <Su is zero over the 
boundaries where rigid boundary conditions are prescribed. 

The specific form of Eq. (3) depends obviously on the nature of the 
problem. For instance, for the rod problem discussed in Sec. 18.2.2, A, u, X 
are scalars. We have 

X = X, 

as the spatial coordinate, 

A( U )=A[ M ( X ) ^]_p (X)= o, 
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as the differential equation over 0 < X < L, and 


£?i(u) = u — u = 0 
du 


B 2 ( u) = EA{L) 


dX 


at X — 0, 
T = 0 at X = L , 


as the boundary conditions. The functional for the principle of minimum 
potential energy is given in Eq. (18.2:16) with its first variation being 

f L r j u Mu 1 

EA(X) 2x2x +P( ' X ' >Su \ dX ~ f6u ^ L) = 0 ’ 




=/ 


shown in Eq. (18.2:17). The admissibility requirements are: u and Su are 
C° continuous over 0 < X < L and u = u and Su = 0 at X = 0. 

In the finite element method, the domain is divided into elements and 
Eq. (3) is written in the form 


(4) ▲ m = <sn e = 0, 

all elements 


which is similiar to Eq. (18.2:17), where 

(5) ▲ 5n e = [ W(u,8u)dV+ f w(u,Su)dS, 

Jvt hv t 

in which V e is the domain of an element and dV e is the portion of dV e 
(the boundary of V e ) where natural boundary conditions are prescribed. 
The admissibility requirements are that u and <5u are sufficiently smooth 
over the whole domain that the sum of <5II e of all elements equals SU and 
that u and <5u must also satisfy the rigid boundary conditions. Functions 
that satisfy the admissibility requirements are called admissible functions. 
The continuity requirements at inter-element boundaries are also called the 
compatibility conditions. Generally admissibility requires C° continuity for 
u and <5u if the differential operators in Eq. (1) is second order, and C 1 
continuity for a fourth order differential operators. 

For the case of rods as described in Sec. 18.2, we have 


J W(u,Su)dV = | J l 

l 


4ea(£) du 8u 




+ p(0 Su 


dS, 


w(u,8u)dS = —a8q 2 T 


as given in Eq. (18.2:18) with the requirement of C° continuity for u and 
5u. We shall derive the expressions of n e or 5ir e for different finite element 
models in the subsequent sections. 
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Establishing the solution based on Eq. (3) or (4) is termed the integral 
formulation. It is also identified as the ‘weak ’ formulation meaning that the 
requirements of satisfying the governing differential equations everywhere 
in the domain and the natural boundary conditions on the force boundaries 
have been relaxed or weakened. Equations (3) and (4) are the “weak” or 
integral forms of the whole domain and Eq. (5) is the “weak” or integral 
form of an element. The components of u in Eqs. (3) and (4) are often 
called fields or field variables of the functional or integral form. 

The solution within an element is approximated in the form 

(6) u = h(X)q, 

where h(X), the shape or interpolation function matrix, is a pre-determined 
matrix function of independent variables X, and q is an unknown param¬ 
eter matrix representing the generalized coordinates or parameters associ¬ 
ated with nodal values of u which may also include the spatial derivatives 
of u. The approximation must meet the requirements of completeness to 
be discussed in the next section to assure that u converges to the exact 
solution, and admissibility to assure the existence of <ffl. The admissibility 
and completeness conditions, due to the continuity requirement, restrict 
the selection of h(X) iri different applications. 

Continuity normally does not cause any problems within the element, 
where the interpolation functions are generally smooth. However, the 
combination of completeness and smoothness often poses difficulty along 
inter-element boundaries for 2- and 3-dimensional problems, especially for 
problems involving fourth order differential equations. In practical appli¬ 
cations, the admissibility condition is sometime violated along a portion of 
inter-element boundaries, especially for problems governed by fourth order 
differential equations. Elements with all interpolation functions meeting 
the admissibility requirements are named compatible elements and those do 
not are identified as incompatible elements. 

The variation <5u is usually, but not necessarily, expressed in the form 

(7) *u = h(X)5q, 

i.e., (iu has the same spatial variation as u and its generalized coordinates 
<Sq is arbitrary subject to the constraint of rigid boundary conditions. This 
usually requires certain components of q at nodes associated with rigid 
boundary condition equal to the prescribed values and the corresponding 
components of <5q equal to zero. In the subsequent discussion, we will 
assume that u and <5u are similar in form as given in Eqs. (6) and (7), 
therefore there will be no need to discuss them separately. 
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Again, for the rod problem considered in Sec. 18.2.2, we have 


u = h(X)q = u = [MO *3(0] 


91 

92 


for linear interpolation, where h's are interpolation or shape function in 
terms of the natural coordinates as those defined in Eq. (18.2:6), or 


u = u = [/ij(£) h 2 (0 /i 3 (0] 


91 

92 


93 


for quadratic interpolation as defined in Eq. (18.2:11). The rigid constraints 
are equivalent to set q\ = u and 5q\ — 0 in the first element. For elastic 
analysis the weak form in Eq. (3) is the first variation of the potential 
energy or the principle of virtual work. 

Equation (3) or (4) is used to derive the algebraic equations for the 
approximate solution. We write the algebraic equations in the matrix no¬ 
tation 

(8) Kr = F, 

where K and/or F depend on q if the problem is nonlinear, q is a gener¬ 
alized coordinate , or parameter matrix of an element, whereas r is that of 
the whole system. The well-known Rayleigh-Ritz process uses the similar 
approach in which the shape function matrix h(X) in Eq. (6) is expressed 
in terms of functions that are valid throughout the whole region and q is 
just a parameter matrix, which may or may not associate with nodal values 
of u. In this case there is usually no distinction between q and r. 

The basic premise of the finite element method is to use elements that 
are small and simple in shape. Over a small region, we can approximate, 
with good accuracy, any field variable by polynomials (shape or interpo¬ 
lation functions), e.g., Eqs. (18.2:1) and (18,2:9), or special functions that 
interpolate the values of the field variable at selected points within the 
small region. From this point of view, q does not have to be the nodal 
values of the field variables and/or its spatial derivatives. For instance, 
in the Galerkin method, the components of q are simply parameters asso¬ 
ciated with some base functions for the field variables. However, having 
the type of q as chosen in Sec. 18.2 makes it easier to meet the continuity 
requirements along inter-element boundaries, which we shall see soon. 

Continuity requirement generally does not cause any difficulty for one¬ 
dimensional problems. Elements for 2-dimensional problems are planes or 
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surfaces and those in 3-dimension are solids. The common boundaries of 
elements are lines or curves for 2-dimensional and planes or curved surfaces 
for 3-dimensional problems. The element boundaries are defined by a finite 
number of discrete points, called nodes. The elements are connected to 
each other at a finite number of discrete nodes. Figure 18.1:1 shows plane 
elements with inter-element boundaries connected by two nodes. The finite 
element method employs the following strategy to meet the continuity re¬ 
quirements for problems of higher dimensions. First, nodal values of u, and 
its spatial derivatives if needed, are chosen as parameters. This approach 
automatically assures the continuity of the field variable at the intercon¬ 
necting nodes as the nodal parameters of all elements have the same values 
at their common nodes. This also provides simple rules for assembling el¬ 
ement matrices to form the global matrices, i.e., to establish the algebraic 
equations for the whole domain. Second, the interpolation functions within 
elements are chosen to 

1. allow q of Eq. (6) easily expressible in terms of the nodal parameters 
of u; 

2. assure that the compatibility of u at the nodes automatically guar¬ 
antees compatibility along the entire inter-element boundaries (an 
important consideration for 2- and 3-dimensional problems), and 

3. uniquely determine the value of the integral form or the functional of 
the element defined in Eq. (5) as a function of q except for possible 
rigid body motions. This is to avoid the zero energy modes to be 
discussed later. 

The element matrices such as the element stiffness matrix and the force 
matrix (to be defined) can then be evaluated from Eq. (5). The unknown 
parameter or generalized coordinate matrix may include values of the spa¬ 
tial derivatives of the field variables, e.g., Eq. (18.3:5). This is often the case 
for higher order differential equations such as bending of beams (Sec. 18.3), 
plates and shells, which involve a fourth order differential equation for the 
out-of-plane displacement. Generally, higher order of continuity is required 
for the field variables involving differentials of higher order. 

The key difference between the conventional Ritz method and the 
finite element method is the manner in which the field variables are ap¬ 
proximated. The Ritz process uses “ assumed functions ” that is valid for 
the entire domain leading generally to a fully populated matrix K in Eq. (8). 
In the finite element process, interpolation functions are assumed for each 
element and each nodal parameter associates only with elements connected 
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to the node. This characteristics leads to a sparse and usually banded ma¬ 
trix K, e.g., Eq. (18.4:3) has a semi-band width of 2 or 3. One can also use 
this property to determine the semi-bandwidth of a particular degree-of- 
ffeedom. By its nature the conventional Ritz method is limited to problems 
with region of relatively simple shape since the assumed functions must be 
valid for the entire domain and satisfy all rigid boundary conditions. For 
finite element method, interpolation functions can be constructed easily 
because elements generally have simple shapes. Assemblies of elements of 
simple shapes are then used to represent complex configurations. 

Equation (4) or Eqs. (18.2:16) and (18.2:17), which represents the func¬ 
tional of the whole domain as the sum of the functional of individual element 
or sub-domain, is a very powerful and versatile concept. It permits the use 
of different approximations for each element. We must, of course, make 
sure that the sum of the functionals of all elements is indeed equal to the 
functional of the whole system as demanded by the admissibility require¬ 
ment, e.g., u must be continuous over the entire domain in Eqs. (18.2:16) 
and (18.2:17). Allowing different approximations for differemt element is 
particularly useful in dealing with problems with singularity or steep gra¬ 
dient. For instance the hybrid singular element (Tong et al. 1973, Tong 
1977, 1984) is used for domain involving singularity. The flexibility also 
allows easy handling of problems with many components such as solid-fluid 
interactions where part of domain is solid and part is liquid, and structures 
involving solid, beams and shells where part of the domain is governed by 
second order differential equations and part by fourth order equations. 

Therefore, the finite element method is inherently an element base 
method in which local approximate solutions in terms of unknown param¬ 
eters are constructed based on the integral form for each element or sub- 
domain. The local solutions would have to satisfy certain compatibility and 
completeness requirements to assure convergence. The approximate solu¬ 
tion for the whole domain is determined from the integral form as given 
by Eq. (4). 

The finite element method permits the systematic selection of interpola¬ 
tion functions, the construction of element matrices, and the assembly of the 
element matrices into a system of simultaneous algebraic equations for the 
final solution. The interpolation functions, the assembled processes, and the 
solution methods for a large system of equations are generic, i.e., they can 
be applied to almost any types of problem. The element stiffness matrix de¬ 
pends on problem type such as beams, plane problems, 3-dimensional solids, 
etc., rather than on the specifics of the problem such as the dimensions, the 
boundary conditions, and the load distributions. The element applied force 
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matrix depends on the distributed loads, but the process of determining it 
is again generic. We can use the same type of element stiffness matrix to 
solve problems of a plane with a square hole or with a circular hole. The 
nature of the problem is reflected in the rigid and natural constraints on 
the boundaries, the sizes, shapes, number of elements, and the values of the 
distributed load. These inherent generic characteristics make it possible to 
apply the finite element method to almost any problems with ease. 

18.8. CONVERGENCE 

We can examine the convergence of finite-element solution to the exact 
solution from two points of view'. One is to increase the number of degrees- 
of-freedom per element to infinity with the element size fixed. The other is, 
by subdivision of the domain into smaller and smaller elements, to reduce 
the size of each element to zero with the number of degrees-of-freedom per 
element (and hence the order of accuracy of the local approximate solution) 
fixed. If one uses polynomials as the approximation within the element, the 
former procedure leads to the so-called p-convergence, where p denotes the 
degree of complete polynomial used as shape functions. This procedure is 
not commonly used in practice because increasing the number of degrees- 
of-freedom of an element means reformulating the local approximation with 
higher order interpolation functions, which can be tedious. The latter pro¬ 
cedure leads to the so-called h- convergence where h denotes the size of the 
element. In the latter procedure, we only have to continuously sub-divide 
the domain being considered, which is relatively straightforward and can 
be used to test for the rate of convergence. 

The p- convergence, i.e., increasing the order of polynomials to infinite as 
the approximate function, can be proved from the convergence of the Taylor 
series expansion. The h-convergence, i.e., reducing the element size to zero, 
can be established as follows. The finite element method constructs the 
solution of the entire domain from the local approximate solutions based 
on an integral formulation. If the solution is to converge to the exact 
solution, the functional or the integral form itself must converge as the size 
of the element diminishes. In proving convergence, then one only has to 
show that the convergence of the functional or the integral form implies the 
convergence of the function itself. 

In practice, there are two convergence criteria: 

1. The functional itself must be mathematically defined over the 
entire domain. In other words, the assumed functions for the inde¬ 
pendent fields must meet the admissibility requirements of the func¬ 
tional. The admissibility requirements are: the enforcement of rigid 
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boundary conditions and C° continuity for independent fields of the 
functional involving first order differentials, C 1 continuity for inde¬ 
pendent fields involving second order differentials and so on. 

2. The highest differentials of the dependent variables (in terms of the 
generalized coordinates) in the integrand of the functional must be 
able to represent any constant within an element as the element size 
approaches zero. 

The second criterion is called the completeness requirements. Interested 
readers are referred to the literature for the details of convergence proof 
(Tong and Pian 1967, Strang and Fix 1973, Babuska 1971, Oden 1969, 
Szabo et al. 1988). 

In the subsequent sections, we shall discuss the shape functions that 
meet the criteria set forth above, the element matrices, and the assembly 
of element matrices. 

18.9. TWO-DIMENSIONAL SHAPE FUNCTIONS 

Consider a two-dimensional domain A shown in Fig. 18.1:1, which is di¬ 
vided into a finite number of triangular and quadrilateral elements. The 
common boundaries, inter-element boundaries , of elements are lines or 
curves defined by two or more nodes. The nodes are usually the vertices of 
the element, the midpoints, the one-third and two-third points, etc. along 
the element boundaries. Elements are connected to each other at these 
nodes. Additional points within the element may also be selected as nodes, 
which are called interior nodes. We will illustrate the use of the nodal 
values of the field variables and their spatial derivatives as parameters to 
characterize the element behavior. The latter is especially needed for higher 
order differential equations. These parameters are called degrees-of-freedom 
or generalized coordinates. For simplicity, in the subsequent discussion, we 
will use the local spatial coordinates only to describe element. 

Consider a family of triangular elements with progressively increasing 
number of nodes on the element boundaries (Fig. 18.9:1). The nodes are 
indicated by dots. We seek to approximate a 2-dimensional smooth function 
in an element by interpolating the values of the function at the nodes. 
Each nodal value is a parameter called a degree-of-freedom. In this case the 
number of degrees-of-freedom to approximate a function equals the number 
of nodes in the element. For the time being, only polynomials are used as 
the interpolation or shape function. Therefore, the number of element nodes 
equals the number of terms needed in the polynomial representation. 3 

3 If the derivatives of the function are also used as parameter, more polynomial terms 
will be needed. 
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(a) (b) (c) 

Fig. 18.9:1. (a) Linear, (b) quadratic, (c) cubic triangular elements. Node 10 is an 
interior node at the centroid. 

The polynomial representation is a truncated Taylor series expansion. 
The order of approximation equals one plus the highest order of complete 
polynomial 4 in the expansion. Therefore, it is desirable to have an expan¬ 
sion which possesses the highest order of complete polynomial possible for 
a given number of degrees-of-freedom. This will give the highest order of 
approximation for a given number of parameters. The more the degrees- 
of-freedom, the higher the order of complete polynomials can be obtained. 
Thus elements with more nodes generally give higher order approximation 
and are called higher order elements. 5 

Shape Function Variation on Element Boundaries. Other than 
choosing polynomial terms to give the highest order of complete polynomial 
for a shape function, we also need to select the type of polynomial terms to 
ensure full inter-element compatibility, i.e., to have the approximate func¬ 
tion meet the admissibility requirement of the integral formulation. This 
may limit the acceptability of certain higher order polynomial terms and 
thus the acceptability of a higher order complete polynomial. Let us con¬ 
sider the case of C° continuity. An interpolated function is smooth and 
generally satisfies the continuity requirement within each element. Since 
the interpolated function has the same value for all elements at their com¬ 
mon nodes, the interpolated function will also meet the compatibility con¬ 
dition at all nodes. However, we must assure that compatibility at the 
common nodes of two adjacent elements implies compatibility along their 
entire inter-element boundary. This requirement limits the variation of the 

4 A complete polynomial that includes all polynomial terms up to certain order, e.g., a 
complete linear polynomial includes the constant and all linear terms, a complete 
quadratic polynomial includes the constant, all linear and quadratic terms, etc. 

5 Elements with more degrees-of-freedom do not give a high order approximation if lower 
order polynomial terms are left out in the shape functions. 
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shape function at element boundaries. For plane elements, element bound¬ 
aries are straight lines or curves. If the shape function is a polynomial 
of order j along a boundary with j + 1 (the number of terms in a j th or¬ 
der complete polynomial) nodes, then full compatibility between the two 
elements sharing this boundary is assured. Thus, shape functions can be 
at most linear on an edge with two nodes [Fig. 18.9:l(a)] to ensure full 
compatibility over the entire edge. A six-node triangle [Fig. 18.9:l(b)] has 
three nodes on each side. Full compatibility is assured, if the shape func¬ 
tion does not have any terms higher than quadratic on the side. For a 
10-node triangle [Fig. 18.9:1 (c)], the shape functions can be cubic at most 
and so on. 

In summary, the number of degree-of-freedom for a field variable is the 
same as the number of nodes in the element if nodal values of these nodes 
are used as the generalized coordinates. It is desirable that the shape 
function has the highest order of complete polynomial for a given number 
of degree-of-freedom in order to get the highest order of approximation. 
However, the type of polynomial terms is restricted by the compatibility 
requirement, which can limit the allowable order of complete polynomials. 
We shall further examine these points below. 

For one-dimensional problems, an inter-element boundary is simply a 
node. If an interpolated function is compatible at the inter-element nodes, it 
always satisfies the compatibility conditions. For two and three-dimensional 
problems, inter-element boundaries are lines and planes, respectively. The 
finite element method enforces the compatibility condition only at the inter¬ 
connecting nodes. Therefore, the interpolation function must be chosen in 
such a way that compatibility at the common nodes of an inter-element 
boundary automatically guarantees compatibility over the entire boundary. 

Shape Functions for 3-Node Triangle. Over the area of a three-node 
triangle [Fig. 18.9:l(a)] with the nodal coordinates (xj, yi),i = 1, 2, 3, 
we can approximate a field variable (f> by the first 3 terms of the Taylor 
expansion as 

(1) <j> = an + ot2X + a 3 y. 

The error in the approximation within the triangle is of the order e 2 where 
e is the maximum linear dimension of the triangle. The approximated (f> 
can be written in the form 

3 

4>{x, y) = Yl q i h i ( x ' = hq ’ 

3 = 1 


(2) 
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where 

(3) 


h x (x,y) 


(ji 


4 > { x i,yi) 

(x,y) 

> q = 

92 

= 


fi3(x,y) 


93 


4>( x 3 , 2 / 3 ) 


with h being the shape function matrix and q being the element generalized 
coordinate matrix whose components q’s are the nodal values of 4> at the 
vertices. One can determine the shape functions by evaluating Eq. (1) at 
the three vertices (nodes) and solving for cv’s in terms of q’s. The solution is 


(4) hj(x,y) = Cj , j = 1,2,3, 



Cl 

1 

“ 2A 

a\ 

h 

Cl 


1 

(5) 

C 2 

CL2 

^2 

C 2 


£ 


C 3 


a 3 


C3 


y _ 


(6) A = (,t 2? /3 - £3t/ 2 + £32/1 - £i2/3 + £iy2 - £22 /i)/2 , 

(7) <2i = £21/3 - £31/2 , hi = y 2 - 2/3 , Cl = £3 - £2 , 


whereas a 2 , 62 ,c 2 and 0 . 3 , 63,03 are obtainable from Eq. (7) by cyclic per¬ 
mutation of the subscripts 1, 2, 3, and A is the area of the triangle. Note 
that C’s satisfy 

(8) C i(Xj,yj)=Sn 

i.e., £ = hi = 1 at node i and equals zero at the other two nodes. This is a 
Lagrange interpolation. The C’s are commonly referred to as the triangular 
coordinates. One can show that C’s are the areas of certain normalized 
triangles that 

.. Ap23 >. Ap3i Apj2 

where A pjk denotes the area of the triangle with nodes P, j, k as shown 
Fig. 18.9:2 with P being a point within the triangle. The C’s are not 
independent that they satisfy the equation 


Cl + (2 4- C 3 = 1 • 
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Fig. 18,9:2. Triangular coordinates. 

One can also show that O = constant is a linear function of x, y and 
is a straight line parallel to the base of the triangle opposite to node j. 
The triangular coordinates are useful for constructing higher order shape 
functions for triangular elements. 

If <p is defined as in Eq. (2) for all triangles that cover a domain and if 
<f) is continuous at all inter-connecting nodes, then <j> is a piecewise linear 
and continuous over the whole domain. Thus tj> meets the C° continuity re¬ 
quirement over the whole domain. Prom Eq. (1), hi are the shape functions 
of the complete linear polynomial. In Eq. (2), the subscript of qj refers 
to the local label, which means that qj is the j th generalized coordinate of 
the element. For assembly of element matrices to form the global matrices, 
we need the information on the global nodal labels Ij of the vertices j, 
j = 1,2,3, of each element in order to establish the assembly list for all 
elements. 

Shape Functions for 6-Node Triangle. Consider a six-node triangle 
[Fig. 18.9:l(b)] with nodes 1, 2, 3 at the vertices, and nodes 4, 5, 6 at the 
midsides of the edges. The nodes are located at Xj , yj, j = 1 , 2 ,... ,6. The 
representation 

4 > = a i + 022: -f £*32/ + c* 42 : 2 + asxy + y 2 

gives an error of the order of e 3 in approximation over the triangle, where 
e is the maximum linear dimension of the triangle. Expressing <f> in terms 
of the nodal values qj, one obtains 

6 

<l>(x,y) = y ) = hq ’ 

j=i 


( 9 ) 
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which is in the same form as Eq. (2) except now both h and q are column 
matrices of six components. The components of h are 

(10) hi{x,y) = G(2C» - 1), h-i+3(x,y) = 4CjCk , 

where i = 1,2,3, and i, j, k are the cyclic permutations of 1, 2, 3. The 
shape functions satisfy 

(11) hi(xj,yj) = S i: j i,j = 1, - - -, 6. 

In other words, hi is associated with node i that it equals 1 at node i and 
zero at all other nodes. Thus, h 's are the Lagrange interpolation. 

The shape functions can be obtained in general by solving for o’s in 
terms of g’s. They can be also systematically derived by the use of the 
triangular coordinates (Zienkiewicz 1971, Gallagher 1974, 1975, Tong and 
Rossettos 1977, Zienkiewicz and Taylor 1989). The shape function hj is a 
normalized product of a minimum number of line functions determined by 
the rules that all nodes except node j must be passed by at least one line 
and that each line must pass through at least one node other than node j. 
The product is normalized to unity at node j. As an example, we see that 
£ = constant is linear in x, y and represents a line in the xy plane. The 
shape function h\{x,y) = Ci(2Ci — 1), with node 1 being a corner node, 
is the product of the two line functions. The line £i = 0 passes through 
nodes 2, 4, 3 on the side of the element opposite to node 1 [Fig. 18.9:l(b)]. 
The other line 2G — 1=0 passing through nodes 5 and 6 is a line parallel to 
the same side. The function h\ equals unity at £i = 1- It requires at least 
two lines to cover the five of the six nodes. Therefore, the minimum number 
of lines required is two and the resulting shape function is quadratic in x, 
y. The shape function h^(x, y) = 4 C 3 C 2 (with node 4 being a midside node) 
is a product of the two line-functions with one passing through nodes 1, 
5, 3 and the other passing through nodes 1, 6, 2. These two lines are the 
sides of the triangle opposite to node 4. The factor 4 is to make h 4 equal 
to unity at C 3 = C 2 = 1/2. Similarly, one can derive other shape functions. 

The product of two line-functions is a quadratic function. The 6-node 
triangle has three nodes on each side. With the shape functions being 
quadratic, the compatibility at the three nodes on the side guarantees com¬ 
patibility along the whole side. The six quadratic functions given Eq. (10) 
are all linearly independent, therefore, hj are the shape functions of com¬ 
plete quadratic polynomial. The function <f> as defined in Eq. (9) is piecewise 
quadratic over the domain. The array /,, specifying the global nodal label 
of the 1 th node of the element, has six components i.e., i = 1,..., 6. 
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Using the triangular coordinates, one can derive families of higher or¬ 
der interpolation functions for triangular elements with ease. For cubic 
elements, one can have four nodes on each side for a total of 9 bound¬ 
ary nodes. A complete cube polynomial has 10 terms that allows one to 
have an extra node to be placed in the interior of the triangle. The shape 
function for the tenth node is 27£iC2C3t which is the normalized product 
of three line functions representing the sides of the triangle and is unity 
at the centroid of triangle for Ci = C 2 = <(3 — 1/3. This shape function 
is also called the bulb function because it vanishes on all sides. The other 
9 shape functions are products of three line functions of which one passes 
through the centroid. For example, in h\(x, y) = Ci(3Ci — l)(3Ci — 2)/2 and 
/t 4 (x>y) — 2CiG(3Ci — l)/9, the line function that passes through the cen¬ 
troid is 3£i — 1 = 0. This choice of line functions assures that all the shape 
functions associated with boundary nodes are zero at the centroid. The 
complete fourth order polynomial has 15 terms, which leads to a 15-node 
triangle with 12 nodes on the sides and 3 in the interior. 

One can derive an element with shape functions being complete poly¬ 
nomial only if the number of boundary nodes needed by the polynomial 
expansion to assure compatibility is less or equal to the number of the com¬ 
plete polynomial terms. It can be shown (Problem 18.10) that triangular 
element allows complete polynomials of all orders in the Cartesian coordi¬ 
nates. For the cubic expansion discussed above, one needs four boundary 
nodes on each side to assure full compatibility of cubic functions. This 
makes a total of nine boundary nodes needed for a triangle. The complete 
cubic polynomial has 10 terms. Thus we can have a triangular element with 
complete cubic polynomials as shape functions. In Fig. 18.9:3, the com¬ 
plete polynomial terms needed for a triangular element are shown in the 
rows above and at the same level of the Pascal triangle. The needed terms 

Polynomial Numbarof Triangular alamant (number of 

Pascal Triangle degree, p terms, n nodes “ number of terms) 

1 0 (constant) 

x y 1 (linear) 

x J xy y 2 2 (quadratic) 

x 5 * 2 y xy 2 y 3 3 (cubic) 

X 4 X*y x J y* xy 3 y 4 4 (quartic) 

Fig. 18.9:3. Relation between type of triangular elements and polynomial terms used 
for interpolation. 



A 

—A 
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are arranged in the shape of a triangle called the truncated Pascal triangle. 
Thus, terms through the second row are needed for linear elements, through 
the third row for a quadratic elements, and so on. 

The representation of a field variable in the form of Eq. (2) or (9), de¬ 
rived without knowing the specifics of the variable itself, is generic. The use 
of natural coordinates makes it easy to construct C° continuous shape func¬ 
tions. The generic representation can be used to construct finite element 
solutions for any problems that require C° continuity. 

Problem 18.9. Use the triangular coordinates to derive the shape functions 
that satisfy the C° continuity requirement for a 10-node triangular element. The 
10 nodes are the three vertices, the 1/3- and 2/3-points of each edge, and the 
centroid of the triangle. 

Problem 18.10. Show that it is possible to construct the shape functions of 
complete polynomial of any order higher or equal to one for triangular elements. 
[Hint: Show that the minimum number of boundary nodes required for complete 
p th order polynomial expansion is 3p while the number of complete polynomial 
terms is (p+ l)(p + 2)/2.] 

18.10. ELEMENT MATRICES FOR A SECOND-ORDER 
ELLIPTICAL EQUATION 

To illustrate the derivation of element matrices, we consider a general 
second-order elliptical equation 

(1) ~-D al3 ~ - g(t> = b on A, 

ox a OXp 

where D\2 = D21, the range of a, ft is from 1 to 2 with x\ = x, X2 — y, and 
D' s, g and b can all be functions of x, y. For Eq. (1) to be elliptical, it means 
that the quadratic form D a 0X a xp is positive definite, i.e., D a px a xp > 0 for 
all ads not equal to zero. Recall that repeated indices denote summation. 
The boundary conditions are 


(2) 

<}> = (j) 

on dA# 

(3) 

n d <t> i 

TlotDctfS ~ — 0(7 

0X0 

on dA a 


where dA^ and dA a are the boundaries over which cj> an d n a D a pj^ are 
specified to be 4> and (pa, respectively, and n a are the components of the 
unit outward normal to the boundary. The boundary of A is dA which 
equals dA$ + dA„. 
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The integral formulation of Eq. (1) can be expressed as the first variation 
of the potential energy Eq. (18.7:4). The variation of the functional for an 
element with cj) = hq and 8cf> = h<5q can be written in the matrix form 

(4) Jlle = <$q T (kq — f), 
where 

{qTkq= L { D “ e ^ §£ + 9 * h ) dA • 

6q r f = — f bS(j)dA + f 4> a 6cf>ds , 

JA C JSA ac 

in which A e is the area of the element and dA ae is the portion of the 
boundary dA a occupied by the element. The admissible requirements are 
that (f> be C° continuous and satisfies the rigid boundary condition. In the 
integral formulation, Eq. (2) is a rigid boundary condition, and Eq. (3) is 
a natural one. The term associated with integration over dA ae accounts 
for the natural boundary condition automatically while it is necessary to 
enforce the rigid condition. The components of k and f are 

(5) kij = + 9hih^j dA , 

(6) fi = ~ f bhidA + j ^ a hids . 

Ja c J&A ac 

The range of i, j is from 1 to 3 for a three-node triangle and is from 1 to 6 
for a 6-node triangle. One can also express k and f in the matrix form 

(7) k = / ( BTD ^ B +5 hTh ) rf ^’ 

(8) f = — f bh T dA + f (/> CT h T ds , 

Ja „ JSAvi 

where 


( 9 ) h = [hi /12 • • • h p ], 


B = dh = 


ah 

dx 

dh 

dy 


Dn D 12 
D \2 D22 


in which D c is the coefficient matrix of the elliptical equation or the elastic 
modulus matrix in elasticity. If h is an interpolation (row) matrix in terms 
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of the triangular coordinates, the differentials with respect to x,y can be 
written as 

dh bi dh dh c* dh 

dqi’ dy~^2AWi’ 


where V s, c’s and A are defined in Eqs. (18.9:6) and (18.9:7). 

One can in principle integrate Eqs. (5) and ( 6 ) analytically by approx¬ 
imating D’s, g, b, (])„ as polynomials. In practice, numerical integration is 
frequently used, which will be discussed later in the chapter. 

Clearly k is a symmetric p x p matrix. The rank of the matrix is p if 
g > 0 in the element. This is because the first term of the integrand of 
Eq. (5) is non-negative for elliptical equation and the second term of the 
integrand is always positive. If g = 0, the rank of the matrix is at most 
p — 1 because k has at least one zero-eigenvector, which corresponds (j> = 
constant, i.e., all components of q are the same, resulting in dcf>/dx a = 0 . 
As pointed out before, the rank of k can be lower if one uses approximate 
integration for Eq. (5). 


Three-node Triangular Element. For a three-node triangular element, 
the shape functions are given in Eq. (18.9:4). For the special case that 
Dn, D 22 , D\ 2 (= D 21 ) and g are constant, we find 


k-^i 

4A 

' b\ 
b 2 bi 

*2 

sym 

D 22 
+ 4A 

c 2 

C 1 

C2C1 

r 2 

c 2 

sym 


6361 

b 3 b 2 



C3C1 

C3C2 

r 2 

c 3 


^12 

4A 

2&ici 

&2Cl + &1C2 

2b 2 c 2 

sym 

+ — 

12 

2 sym 

1 2 


63C1 + 61 c 3 

b 3 c 2 + b 2 c 3 

263C3 


1 1 2 


If b is constant and if no element boundary is on dA a , then 



The finite element process lumps 1/3 of the applied load over the element 
to each of the vertices. 


Six-node Triangular Element. For a six-node triangle, the interpolation 
functions are given in Eq. (18.9:10). In this case, k is a 6 x 6 matrix and f is 
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a 6 x 1 matrix. The components of k and f can be evaluated from Eqs. (5) 
and (6) or (7) and (8). If Du, D 22 , D\ 2 {= D%\), g and p are constant, we 
find 


k = [Diik(6i,bs) 4 - 2i?i a k(h i ,Ci) + T> 22 k(c i ,c t )]/A 4 - pk 2 A. 

Both k and k 2 are symmetric. The components of the lower triangle of 
k(6»,Ci) are 

ku = biCi/2 i — 1,2,3 (i not summed), 
kij = (hcj 4- bjCi)/12 i,j = 1,2,3 (i ^ j ), 

&61 = k &2 = (6 2 ci + f>ic 2 )/6, 

&42 = &43 = (6 3 C 2 + 6 2 c 3 )/6, 

&53 = ^51 = ( 63 C 1 4-6iC 3 )/6, 

&63 = ^41 — &52 = 0 , 

= [(61 4- 26 2 )c 2 + (26i 4- 6 2 ) c i]/3 , 

^44 — [(^2 + 263)03 + (26 2 -|- 63 )c 2 ]/3 , 

^55 = [(63 + 26i)cj + (263 4 - 6i)c 3 ]/3 , 

^64 = [(62 4- 26 3 )cj 4 - (61 4- 26 2 + 6 3 )c 2 4- (26i -4 6 2 )c 3 ]/ 6 , 

£54 — [(63 4- 26i)c 2 4- ( 6 2 4- 263 + 6 i)c 3 4- (26 2 4- 6 3 )cij /6 , 

^65 — [( 6 i 4- 26 2 )c 3 4- (63 4- 2b\ 4- 6 2 )ci 4- (26 3 4- 6 i)c 2 ]/6 , 



6 1 


0 


—1 6 sym 


0 

1 

-1 -1 6 

i- bA 

0 

180 

-4 0 0 32 

’ 3 

1 


0 -4 0 16 32 


1 


0 0 -4 16 16 32 


1 


We obtain k(6i,6») from k(6,-, c<) by replacing c t with 6, and similarly 
k(cj,Ci) by replacing 6, with c*. The expression for f above is only for 
elements with dA ae = 0 or <j> a = 0 on dA ae . Otherwise we must include 
the contributions from the line integral along dA ae . We see from f that 
the finite element process lumps 1/3 of the applied load over the element 
to each of the three midside boundary nodes and nothing to the vertices. 
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This is different from the linear triangle, which lumps 1/3 of the applied 
load to each of the vertices. 

For higher order elements, the process of deriving element matrices is 
identical to that for the lower order elements. One simply uses higher order 
interpolation functions in Eqs. (5) and (6) or (7) and (8). The evaluation 
of higher order elements involves the integration of quantities defined in 
terms of triangular coordinates over a triangular region. The following 
exact integration formula has been proven to be useful 


( 10 ) 


L 


Cf C 2 C 3 — 2 a 


al/Sly! 


(a + (3 + 7 + 2 )! ’ 


18.11. COORDINATE TRANSFORMATION 

In principle, triangular elements can approximate a domain of any 
shape. For a domain with curved boundaries, it may require many small 
elements to get a good geometrical approximation. We can get a better ge¬ 
ometrical representation by using elements with curved sides but then the 
difficulty is to find interpolation functions in terms of x, y that can fulfill 
the compatibility requirements. The strategy is to map curved triangles 
into an isosceles right triangle and map curved quadrilaterals to a square 
(to be discussed later), then all the shape functions we have developed will 
become usable. The key requirements for the transformation are that there 
is no gap between mapped elements and that compatibility at the connecting 
nodes of two adjacent elements implies compatibility along the whole curved 
inter-element boundaries of the two elements. 

The mapping of an element with curved boundaries into one with stra¬ 
ight boundary edges involves a proper transformation of geometry. Con¬ 
sider a transformation of variables from £, rj to x, y 

(1) x = x(^,rj), 2/= </(£,r?) • 

The specific forms of x(£,rj), y(f, rj) depends on the shape of the element 
and will be discussed case by case. We shall first identify the relationships 
between £,77 and x,y. To shorten the derivation and simplify the final 
results, we define 



X 

I . = 

£ 

X = 

y 

V 



For later convenience, we take a digression to define the following 
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transformation matrices: 



dy T 

dy __ 


dr) 

dx 

dx ’ 

dy 

dy 

dy. 


9x <3x 
d£ dr) 


dx dy 

di di 

dx dy ’ 
dr) dr) 


d * d<b 
dx dy 


di 

drj 

dx 

dx 

dt 

drj 

dy 

dy 


the first order differential operators 




d, = 0 5T 

dy 


= J—=Jd, 
dx 


The inverse transformation £ = £(a :,y) and rj = r){x,y) of Eq. (1) is in 
general not available. We cannot calculate J or any partial derivatives of 
r\ with respect to x, y directly. We can use the following relations derived 
from Eqs. (3)-(5) for the purpose: 
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(7) 

( 8 ) 

(9) 


J = det(J) = 


Ox dy 

d( dr) 


dy Ox 


J = J 1 


dy 


1 

J 


dy 

dx 


dr] 


dy ' 

d£ 

dx 

di. 


d<H 1 

dy 

dy 

T 

0'S? 1 

dx 

dx 

dx J 

dy 

d£. 

’ 


dy 

d£_ 


where J is the Jacobian of the transformation. For physical reasons, we 
must limit the transformation to J > 0. 

The integration of a function with respect to £, r/, and to x, y are re¬ 
lated by 


/ u{x,y)dxdy — 
Ja c J a 




u[x(Z,r)),y(£,r))]Jdtdr). 


For triangular elements, we generally use the triangular coordinates for 
the transformation. Equation (1) is written in the form 


x = Z«1,C2,C3) , y = 2/«l,C2,<3) , 

with 

£ = Ci i y = C2, <3 = 1 — Ci — C2 • 

The derivatives with respect to £, 7 / can be written as 

0 _ d d d _ d d 

di dC,i d( 3 ’ dy d( 2 0( 3 ' 

The differential operator d and the transformation matrices J and J become 


d ' 


d d 

di 


9(i d( 3 

d 


d d 

dy 


dC~2~d(~3 


dx 

dx 

dy 

_ ^y. 


dy 

_ dy_ 

dy 

dy 

Wi~ 

~ d (~3 

dCi 

d ( 3 

j_I 

d (2 

d ( 3 

dC 3 

d< 1 

dx 

dx 

dy 

dy 


dx 

dx 

dx 

dx 

db 

" dCa 

d (2 ' 

d ( 3 


d( 3 

d (,2 


~d( 3 _ 


(11) J = 
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For triangular elements with all sides straight, the geometry transfor¬ 
mation is 

x = XiQ , y = yiCi - 

We can calculate the transformation matrices and differential operators 
as defined above. With p T = [ci <, 2 c.a j, the differential operators can be 
expressed also in terms of the triangular coordinates directly as shown below: 


( 12 ) 


d 


d_ 

dx 


dp 

dx 

dp 

L d V J 


d_ 

dp 


J d , 

(2x3)(3x 1) 



d d d 
dq i dq 2 dq$ 


(13) 


dp 

dx 


2A 


[bi b 2 


63], 


dp _ 1 
dy 2A [Cl ° 2 


c 3 ], 


where 

< 14 > 


b \ 6 2 63 
Cl c 2 c 3 


in which 6’s, c’s and A are defined in Eqs. (18.9:6) and (18.9:7). One can 
then calculate d e according to Eq. (6). 

Recall that, in Chapters 2-4, the components of coordinates are defined 
as the contravariant components of a tensor, and the components of defor¬ 
mation gradient or strains are the covariant components of a tensor. The 
transformation of components between different coordinate systems follows 
the tensor transformation laws. If the coordinate systems are orthogonal 
and if the physical components for those quantities are used, there is no 
need to distinguish the covariant and contravariant transformations. The 
natural coordinates for triangles and those for general quadrilaterals used 
in the finite element methods are not orthogonal. The transformation de¬ 
scribed in this section is the covariant transformation of gradient operators 
in the matrix form. 


18.12. TRIANGULAR ELEMENTS WITH CURVED SIDES 

A family of such elements is shown in Fig. 18.12:1. There is no need for 
curved 3-node triangular element since the approximation over the element 
is linear. Thus only higher order elements are considered. In the £, //-plane, 
the element is an isosceles right triangle A e located at the origin in the first 
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(a) (b) (c) 

Fig. 18.12:1. Triangular elements with curved sides: (a) linear, (b) quadratic, and (c) 
cubic. 

quadrant with 0 < £, rj < 1 (Fig. 18.9:2). The area of triangle is 1/2. The 
specific form of :r(£, r/), y(£, i]) will depend on the order of the curves of the 
element. The transformation can be written in the form 

2 = X j h j(Cl,<2,&), 

all nodes 

y = Y y i h i( C1.C2.C3), 

all nodes 

where h’s are shape function discussed earlier, x's and y's are the coordi¬ 
nates of the nodes and 


e = Ci, v = Gt, & = 1 - Ci - <2. 

Elements derived by the transformation using shape functions of Eq. (1) 
are called isoparametric elements (Taig 1961, Irons 1966, Coons 1967) as 
one uses same shape functions for the interpolation of the field variable as 
well as for the transformation of the spatial coordinates. 

The element matrices are in the exact same form as given in 
Eqs. (18.10:7) and (18.10:8) except that the integrands are now in terms of 
C and 


1 

J 


dy dy dy dy 1 f db. c?h 



“ rOC.3 

d( 3 ' 

~ dCi 

dCi 

" dCs 

dx 

dx 

dx 

dx 

dh 

dh 


d/a 0('2 d(i < 9/3 J |_ d(- 2 o >< 3 


' dh 
~ d(, 

(2) B = Jdh = J 

dh 

dr] 


where J is the Jacobian. 
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The boundary integral in Eq. (18.10:8) becomes an integral along a 
straight line. Suppose that we have to integrate along the side of the 
triangle from corner i to corner j, where i,j — 1,2, or 3 (i /= j). Let corner 
k be the third vertex. Along the side, we have Ci = 1 — Cj and 



(j not summed). The integration with respect s is now with respect to Q 
ranging from 0 to 1. 

One can in principle express the area integrals in Eqs. (18.10:7) and 
(18.10:8) in the form 

f /*1 t >\ — ?2 

/ (■■■)dxdy= / / (■ • • )Jddd<r 2 

JA „ Jo Jo 

with C 3 = 1 — Cl — C2- hi practice, we carry out the integration either 
numerically or according to the formula of Eq. (18.10:10). It is not essential 
to use the expression above. 

It can be seen that under the transformation the interpolated function 
is still continuous over the inter-element boundaries. Note that the trans¬ 
formation of the common boundaries of two adjacent elements is the same, 
even though the transformation of the two elements involves different trans¬ 
formation functions. This is because, from Eq. (1), the transformation of a 
side depends only the coordinates of nodes on that side, as shape functions 
associated with nodes not on the side vanish. Thus, the transformation 
is independent of the geometry of the element other than the side of in¬ 
terest. Now, we have the same coordinates along the common boundary 
of the two adjacent elements before and after transformation, which im¬ 
plies that there is no gap between adjacent elements after transformation 
and that the compatibility conditions for the interpolated functions are no 
different from those without transformation. Therefore, compatibility at 
inter-element nodes guarantees compatibility along the entire curved inter¬ 
element boundary. 

The element matrices are in the same form as given by Eqs. (18.10:7) and 
(18.10:8). The assembling, constraining and equation solving procedures 
are the same as described in Secs. 18.4 and 18.5. No further elaboration 
is needed. 


Six-node Curved Triangle. The triangle shown in Fig. 18.12:1 (b) has 
six nodes. Each side of the triangle is described by a quadratic curve. The 
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coordinate transformation is 

6 6 

(4) x = '}Tx j h j (t 1,C2,<3), 2/ = X^'MCl,<2,C3), 

:;=i i=i 

where Xj,yj are the coordinates of the nodes in the x, y-plane and hj are 
the shape functions for the quadratic element given in Eq. (18.9:10). 


18.13. QUADRILATERAL ELEMENTS 

Although triangles have the advantage of approximating domain of any 
boundary shape, the linear triangular elements are not very accurate that 
a fine finite element mesh is often needed to achieve acceptable results in 
applications. To gain better accuracy, one has the option of using higher 
order triangular elements or elements of other shapes. Rectangles or quadri¬ 
laterals are commonly used as rectangular and quadrilateral elements are 
generally more accurate than triangular elements. We shall derive shape 
functions and construct the matrices for these elements. 









2 ( 1 - 1 ) 


Rectangular Element. Shape 
functions for the rectangle shown 
in Fig. 18.13:1 can be easily de¬ 
rived. The natural or intrinsic co¬ 
ordinates are 


(1) 


€ = 2 ^i-l. 


r,y-yi , 

V = 2-1, 


where x\,y\ are the coordinates 
Fig. 18.13:1. A four-node square element. , 

of node 1, e x , e y are the ele¬ 
ment size in the x, y plane and 
rj are defined over the domain -T < £, r; < 1 regardless the size of the 
element. A field variable can be expressed in terms of shape functions 


4 

4>{%,y) = '52<jj h j(Z,v), 

j=i 


where 


hi(£,v) = (! — ^)/4 , 

hi (£, 17 ) = (1 +0(! +h)/4, 


= (i +£)(! -v)/4, 
h*(£,ri) = (1 -£)(! + r /)/ 4 - 
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These functions are called bilinear function since they are linear in £ and 
in r). The shape function can be written in the more compact form: 

(2) hj{£,i ?) = (1+£j£)(i+ ??if?)/ 4 j = 4. 

where £i = £4 = ??i = % = -1 and £2 — £3 =■- Vi = Vi = 1. Obviously hj 
equals 1 at the j th node and 0 at the other three nodes. Therefore, like the 
triangular element, 

q j = <t>(x j ,y j ) = <t> j j = 1, • • • ,4. 

Since 4> varies linearly along each side and is compatible at the two cor¬ 
ners, therefore, it is compatible with its adajcent elements along the entire 
inter-element boundary between the two corners. In other words, <f> is C° 
continuous over the whole domain. If Dn,D 22 , Dn(~ -D 21 ), <7 and p are all 
constant, the element matrices for the problem defined by Eqs. (18.10:1)- 
(18.10:3) are 

2 

-2 

-1 
1 

sym 

+ j/k2 , 

1 





4 

2 4 
1 2 




Just like the case of triangular element, the applied force matrix is for ele¬ 
ments with dA ae = 0 or <j> a = 0 on dA oe only. Otherwise one must include 
the contributions from the line integral along dA ae . The finite element 
process accounts for the distributed load over the element by lumping it 
equally to the 4 corner nodes. 
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Figures 18.13:2 and 18.13:3 show families of rectangular serendipity 
and Lagrange elements where progressively increasing and equal number of 
nodes are placed on the element boundaries. Heavy dots denote element 
boundary nodes, whereas open circles indicate interior nodes. The variation 
of the function on the edges to ensure continuity is linear, quadratic, cubic, 
etc. for the increasingly higher order elements. The difference of two fam¬ 
ilies is in the number of interior nodes. One can use the approach similar 
to that for triangular elements to generate the shape functions for higher 
order elements by multiplying line functions that passes through nodes of 
the element. 

Alternatively we can generate shape functions by utilizing the bubble 
function. Consider a quadratic Lagrange element. The bubble function, 
which is zero on the element boundaries, is simply hg = (1 — £ 2 )(1 — r) 2 ), 
the shape function associated with the interior (central) node 9. The 
shape function for the midside node j involves the bubble function and the 



(b) Quadratic 



• Exterior node o Interior node 


Fig. 18.13:2. A family of rectangular ‘ Serendipity’ elements: (a) linear, (b) quadratic, 
(c) cubic and (d) quartic. 

6 The name was coined by Zienkiewicz after the famous Prince of Serendip noted for 
their chance of discoveries. 
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(b) quadratic (c) cubic 


• Boundary node ° Interior node 

Fig. 18.13:3. A family of rectangular Lagrange elements: (a) linear, (b) quadratic, 
(c) cubic. 

products of functions representing all the sides that do not include node 
j. For example, the shape function for node 5, the midside node between 
nodes 1 and 2 (Fig. 18.13:3), is 

h 5 = (l-£ 2 )(l-T?)/2-h 9 /2. 

The first term on the right hand side is the product of the line functions 
for the sides £ = ±1 and rj = 1. The factor 1/2 is to normalize to unity 
at node 5 (£ = 0,77 — —1). The bubble function is introduced to make 
/15 = 0 at node 9. The shape function for a corner node can be obtained by 
subtracting from the bilinear shape function associated with the node half 
of the shape functions of the two adjacent midside nodes and a quarter of 
the bubble function. For corner node 1, we have 

hi = (l- 0(1 ~ rj )/4 - (h 5 + hg)/2 - hg/ 4. 
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Table 18.13:1. Shape functions of a rectangle with missing node(s). 


Include only if node j is present in the element 


Shape function 

i = 5 

j = 6 

i = 7 

J =8 

3=9 

hi =( 1 - 0(1 - 0/4 

-hs/2 



-he/ 2 

-hg/4 

= (1 + 0(1 — 0/4 

-hs/2 

—hs/2 



-hg/4 

h3 = (1 + 0(1 + 0/4 


-h 6 /2 

—hr/2 


—hg/4 

h 4 = 0 - 0(1 + 0/4 



-h-,/2 

-hs/2 

-hg/4 

h 5 = ( i - 0 )( i - 0/2 





-hg/2 

1*6 = ( 1 + 0 ( 1 - 01/2 





-hg/ 2 

h 7 = ( l - 01 ( 1 + 0/2 





-hg/2 

hs = (1 - 0(1 - 01/2 





—hg/2 

1 

kr 

1 

11 

£ 







The subtracted terms are to make hi equal to zero at the midsides and 
the center. 

Table 18.13:1 gives the shape functions for the 9-node quadrilateral La¬ 
grange element. If hg is set to zero in Table 18.13:1, we obtain the 8 shape 
functions for the 8-node quadrilateral serendipity element. One can obtain 
the shape functions for elements with missing midside node(s) from Ta¬ 
ble 18.13:1. If a midside node is missing, we exclude the shape function of 
the missing node and remove from the expressions of the shape functions 
for the corner nodes the terms involved the shape functions of the missing 
nodes. For example, for an element without node 5 as shown in Fig. 18.13:4, 
we remove /15 from hi and h 2 to obtain 

hi = (1 - C)(l - ? l)/4 - h s /2 - hg/4 , 
h 2 = (1 — 0(1 — t ?)/4 — he/2 — hg/4 . 

The other shape functions do not involve and remain unchanged. On the 
side with missing midside node, the field variable varies linearly. The side 
can be connected to a linear element while the other sides are connected to 
quadratic elements. This arrangement is useful for transition from linear 
elements to quadratic elements in the discretization of the domain. 
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Pig. 18.13:4. Rectangular element with no midside node between nodes 1 and 2. 



Fig. 18.3:5. Polynomial terms for plane Serendipity and Lagrange elements. 

It is instructional to examine the polynomial terms that are included 
in (or missed from) the shape functions. In Fig. 18.13:5 polynomial terms 
above the solid lines are those included in the expansions of shape function 
of the corresponding Serendipity elements while those above dotted lines 
are included in the corresponding Lagrange elements. The linear element 
includes one quadratic term £rj and the highest order of complete poly¬ 
nomial is only linear. For the quadratic and cubic elements, the highest 
complete polynomials are quadratic and cubic, respectively. However, for 
the quartic Serendipity element, the expansion of the shape functions as¬ 
sociated with the 12 boundary nodes misses the £ 2 r) 2 term. The error in 
approximation would be of the order £ 4 unless an interior node is included. 
Then the shape functions can form a complete fourth order polynomial and 
the error in approximation is of the order e 5 . Therefore, it is essential to 
have the central node added. 
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The higher order Lagrange elements contain some very high order terms 
while omitting some lower order ones. For example, a cubic element has 
16 nodes. It contains the term £ 3 7? 3 while omits terms like £ 4 and if. The 
accuracy of the higher order Lagrange elements is limited by the omitted 
terms. A cubic Serendipity element has the same order of truncation error 
as the cubic Lagrange element but has only 12 nodes. 

Isoparametric Quadrilateral Elements. Rectangular elements are gen¬ 
erally more accurate than the triangular elements of the same order. How¬ 
ever, rectangular element suffers the shortcomings of not being able to ap¬ 
proximate irregular domains. We shall introduce a family of quadrilateral 
elements (Fig. 18.13:6) that can approximate domains of any shape and 
in the mean time provides better accuracy than the triangular elements 
of the same order. Following the approach employed for curved triangular 
elements, we use shape functions to map the quadrilateral elements into a 
square and then construct the element matrices in the transformed coordi¬ 
nates. The transformation is in the same form as Eq. (18.12:1) that 

(3) x = Xjhjifn ), y = Y yj h i(^v), 

all nodes all nodes 

where hj are the shape functions associated with node j of the element. If 
so desired, one may use the nodes on the element boundaries only for the 
transformation. For a Anode quadrilateral element, the shape functions are 
defined in Eq. (2). For an 8-node serendipity or 9-node Lagrange quadri¬ 
lateral elements, the shape functions associated with boundary nodes are 




< b > 


(c) 


Fig. 18.3:6. Quadrilateral elements. 
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the first eight given in Table 18.13:1. The mapped element is a square over 
—1 < £, p < 1 in the £, ? 7 -plane (Fig. 18.13:1). 

The element matrices given in Eqs. (18.10:7) and (18.10:8) can be now 
written as 


(4) 

(5) 


k = J j (B T D e B + gh T h)Jd£dp , 
r l r 1 r 

f = - / / bh T Jd£dp + / 4> a h 7 ds , 

J-i J- 1 JSA ae 


where, similar to Eq. (18.12:2), 


'3hl 


dy 

dy' 


'ah' 

d£ 

1 

dp 

d£ 



dh 

- J 

dx 

dx 


0h 

dp _ 


dp 

d£. 


dp _ 


The resulting element is called an isoparametric element as the shape func¬ 
tions are used for coordinate transformation and for interpolation of the 
field variable(s). 

The boundary integral in Eq. (5) involves integration along the side for 
£ or r] = ±1 if 8A< r e is not null. If the integration is along the side £ = a 
where a = 1 or —1, the boundary integral can be written as 



If the integration is along p = a, the boundary integral is 


( 8 ) 


L 


T h 1 ds 



The integration in Eqs. (4) and (5) is complicated, because of the pres¬ 
ence of J in the denominator in Eq. (6). Even for the simplest 4-node 
isoparametric element with constant D e , analytical integration, though pos¬ 
sible, is not practical. Numerical integration is the only practical means to 
evaluate the element matrices for isoparametric elements. 


Static Condensation. There are boundary and interior nodes for cu¬ 
bic triangle, and quadratic Lagrange and quartic Serendipity quadrilat¬ 
erals or elements of higher order. The shape functions associated with 
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interior nodes are bubble functions. The generalized coordinates associ¬ 
ated with the interior nodes do not appear in domain outside the element 
and can be eliminated from the rest of equations by the stationary condition 
of the functional. The process is referred to as static condensation. 

We partition q, k, f of Eq. (18,10:4) in the corresponding forms 


q b' 

, k = 

k bb 

kafc 

, f = 

"ft' 

.q a . 


k T 

K ab 

^aa 


fa 


where subscripts a and b denote that the concerned quantities are asso¬ 
ciated with the generalized coordinates of interior and boundary nodes, 
respectively. For quadratic Lagrange element q a = (qg) is 9. For the gen¬ 
eralized coordinate of the only interior-node cubic Lagrange element, there 
are three interior nodes (Fig. 18.13:3). By virtue of the fact that q Q does not 
appear in other elements, the vanish of the first variation of the functional 
with respect to q a implies <5II e = 0, i.e., 

kai>qfe + h fta q a f a ™ 0 or q a — k aa k a q b -I- k aa fa > 

which can be used to eliminate q a from Eq. (18.10:4). This process leads 
to the following reduced element stiffness matrix and force matrix: 

k* = k&t - k^k^kat , f* = f + k^kjf; , 

which associates only with the generalized coordinates q& of the boundary 
nodes of the element. One assembles k* and f* to form the global matrices. 


18.14. PLANE ELASTICITY 

We have derived shape functions for a variety of element shapes. Up to 
now, we have considered primarily problems with one field variable. Gen¬ 
eralizing the finite element process to problems of multiple field variables is 
straightforward. Let us examine problems of plane elasticity, which involves 
the following field variables: 


(1) 




£X 


du/dx 

e = 

e y 


dv/dy 


Ixy 


du/dy + dv/dx 


= d e u, 
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where u, o and e are column matrices, whose components are the dis¬ 
placements, the stresses and the engineering strains, respectively. For 
3-dimensional problems u has 3 components and both cr and e have 6 
components. They are not tensor as defined in previous chapters. 
The components of u, a and e are the components of the corresponding 
quantities referred to a specific coordinate system. Thus they are not equal 
in different coordinate systems. Consider two coordinate systems x l and x l 
which are related by x l = R^xK Then, in the matrix form, we have 

u = Ru, 

with the components of R being Rij. The relationship between cr, e 
and a, e can be derived from the tensor transformation rules. For 
2-dimensional case, the tensor rule gives 


V 1 a 12 


*n 

f?21 


V 1 cr 12 


Rn 

Rl2 

a 21 <r 22 


Rl2 

i?22 


a 21 a 22 


R 21 

R22 


then 





R\ 1 

*12 

2 * h * i 2 


'a 11 ' 

a = 

a 22 

= 

^21 

*11 

2 * 22*21 


a 22 


.* 12 . 


.* 11*21 

* 22*12 

* 11*22 + * 12*21 


a 12 


In general, u ^ u , a ^ a and e ^ e based on the matrix definition. Based 
on the tensor definition, u = u, cr = <7 and e = e, but their components will 
be different in different coordinate systems. Note that <r 7 e = e T cr in the 
present matrix definition and is equal to <x T : e(= e T : tr) if a and e are the 
stress and strain tensors of the previous chapters. The engineering shear 
strain j xy rather then the tensor shear strain e xy [= ( du/dy + dv/dx)/2] 
[Eq. (4.2:7)] is used in Eq. (1) to make the elastic modulus matrix D e of 
Eq. (2) below symmetric. 

The generalized Hooke’s law can be written in the matrix form 
a — D e e, 


where 


'#11 


sym 

(2) 

D e = 

Du 

D22 




Dl 3 

D23 

-D33 


is the elastic constant , or modulus matrix. The matrix D e is also not 
a tensor but its components are related to the elastic modulus 
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tensor C l: > kl of Eq. (6.1:1). The strains can be expressed in terms of 
displacements 

(3) e = d e u, 

where d e is the differential operator defined in Eq. (18.11:6). 

Consider a plane elasticity problem over a domain A with prescribed 
boundary conditions 

(4) u = u on dA u , 

(5) T = T on dA „, 

where T is the boundary traction that its components are 
T y = n i 

O xx “b Ll2 ®xy ) 

T 2 — Tl\0>xy “b n^CTyy , 

in which n is a unit normal to dA„. In this and subsequent chapters, we 
shall use the capital T to denote traction rather than the Lagrangian stress 
as used in the previous chapters. 

In integral formulation, the functional for the potential energy of an 
element is 

(6) Jl e — J ^e T D e e — u T b^ dA - J u r T ds 

= [ -(d e u) T D e (d e u) — u T b dA— f u Tr Tds, 

J A e L2 JdA ,„ 

where b is the distributed body force vector. The integral formulation cor¬ 
responds to the vanishing of the first variation of the functional for the 
whole domain with respect to the field variable u subjected to admissibility 
constraint. The prescribed displacement condition Eq. (4), a rigid condi¬ 
tion, must be enforced whereas the prescribed traction condition Eq. (5), a 
natural condition, is accounted for by the integral over dA a . 

Note that the form of Eq. (6) would have to be modified slightly for 
mixed boundary value problems. If the mixed boundary conditions are 

u n = u n , T t — T t on dA' a< ,, 
u t — u t , T n = T n on 9A" e 

where n and t denote the normal and tangential direction along the element 
boundaries. Additional terms must be added to II e of Eq. (6) to account 
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for the prescribed traction components. We have 



We can easily establish the finite element equations using the integral 
formulation. Consider an element of p nodes with nodal coordinate Xj, yj 
for the j th node. Let us denote the nodal values of u and v as 

<? 2 j-i = u(xj,yj ), q 2j = v(xj,yj ), 

where q’s are the generalized coordinates with their subscripts being the 
local label. Each node now has two degrees-of-freedom, as opposed to one 
considered earlier. Utilizing the shape functions derived earlier, we can 
interpolate both field variables in the form 

p 

u = > 

1=1 

P 

v = ^2 h j( x >y) q 2j ■ 

i=i 

In matrix form, they can be written as 

(9) u = hq, e = d e u = Bq, 

where d e a differential operator matrix defined in Eq. (18.11:6), 


(7) 

( 8 ) 



with h being a 2 x 2 p interpolation matrix [h is a row matrix for problems of 
one field variable in Eq. (10.10:9)] and B is the strain-displacement matrix 
relating the nodal displacement matrix 


q = [<?l <72 ... ?2p-l ?2p] T 
2px 1 


to the strain matrix e. 
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A substitution of Eq. (9) into Eq. (6) yields the standard equation for 


element 


(12) 

n e = ^q r kq-q T f, 

where 


(13) 

k = f B T D e B dA, 

JAt 

(14) 

f = f h T b dA + [ h r T ds 
J A,. JdAec 


These equations are similar in form as Eqs. (18.10:7) and (18.10:8) except 
that D e , B are now defined in Eqs. (2) and (11), respectively. 

The stiffness matrix k is symmetric and has the rank of 2 p — 3, because 
there are 3 rigid body motions for plane elasticity. However, the rank can 
be higher if the shape functions are incapable of representing certain rigid 
body modes. The rank can be lower also if the integration in Eq. (13) is 
carried out only approximately, which can introduce spurious deformation. 

For a three-node triangle, p = 3, /i, are linear functions of x,y [or 
C 1 .C 21 C 3 , Eq. (18.9:4)] and the strain e represented by Bq is constant. 
Therefore, a three node triangular element is also called the constant strain 
triangle. A 6-node triangular element uses quadratic shape functions 
Eq. (18.9:10) and has linear strains. It is called the linear strain triangle. 

For elements with curved sides, the element matrices can be written in 
the similar form 


(15) 


f B T D e B Jd^drj, 

A t 

(16) 

f =) 

h T hJd£dr] + f 

A „ JdA a , 


where , rj are the transformed coordinates, A e is a right isosceles triangle 
of area 1/2 for triangular elements and a square of area 4 for quadrilateral 
elements, and 


(17) 


d 

dx 


B = d e h = 0 


0 



d d 


dy dx 


which can be written as 
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?i± + !*lJL o 

dx dx dy 

dy d£ dy dr) 

d£d_ &nd^ d^o_ 

dy d£ ^ dy dy dx d£ dx dr] 

in which h is defined in Eq. (10). The boundary integral of Eq. (16) is 
in the form of Eq. (18.12:3) for triangular elements and Eq. (18.13:7) or 
(18.13:8) for quadrilateral elements with h T (f> a replaced by h T T. 

The evaluation of the matrix B is straightforward but tedious. Accord¬ 
ing to Eqs. (18.11.3), (18.11:8) and (18.13:3), 




(18) 


(19) 



dy'] dhi dhi " 

dt = y Xi d£ Vi di 

dy all nodes x .^i v 

Yy\ dr] Vl dy J 


' d£, dy 1 [ dhi dfh 

j_i _ dx dx _1 _ y ^ 1 dy ^ d£, 

Hi dy J all “ des dhy mu 
. dy dy L dy 1 d£ 


( 20 ) 


dx dy dy dx 

d£ dy d£ dy ’ 


where repeated indices denote summation over all nodes involved. Then 
from Eqs. (7)-(9), we obtain 


du 
dx 
dv 

(21) e = Bhq = ^ 

du dv 
dy dx 

fdhidy _ dhidy\ <?2,-i 
V d£ dy dy at) J 

(dhi dx dhi dx\ qu 

\d^dl'~d^a^) T 

/dhi dx dhj dx\ qu-i ( dhj dy _ dhj dy\ 

J \ d£ dy dydt) 
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The range of summation in Eqs. (18) and (19) is the same as that of Eq. (21), 
if the transformation Eq. (18.13:3) involves all nodes of the element. Note 
that we have 


( 22 ) 


dhi 

/ dhi dy 

dhi dy\ 

1 

dx ' 

V d £ d V 

dy di) 

J’ 

dhi 

f dhi dx 

dhi dx \ 

11 

dy 

V d V d£ 

d£ dyj 

’ J ’ 

= #2,21 , 

#3,2, = 

= #l,2i-l 1 



with unlisted B's being zero. The geometry of the element affects the 
element matrices through the values J and J -1 . 

For a 4-node quadrilateral element, from Eq. (18.13:2), we have 

^ = ^4i(l + w)» ^ = j*7<(1 + &£) {i not summed). 

Then 

J = lo z«£i(l + ViV) 3/i&( 1 + ViV) 

L xM i + e<0 vM 1 + 60J ’ 

, . ! _ yiVi(i + €i€) -yMi + Viv) 

( a) "^^[-^(1 + ^) *«€*< i + w ?) J ’ 

j 4 

j = Jg ( x ' y i - + ViV){l + isO , 

i,j =1 

The Jacobian J is not a constant for elements other than rectangle and 
parallelogram. From the expressions above, one can conclude that it is 
best to numerically integrate Eqs. (15) and (16) for given D e . 

In integral formulation, the sum of the variation <5II e of all elements 
must be zero, i.e., 

sn e = J2 *q T (kq-f) = o 

all element all element 

with the components of 5q corresponding to rigid constraints being zero. 
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Plane Stress. For plane stress problems, if the material is orthotropic, we 
have 


(23) 


D e 


1 

1 VxyVyx 


E x 

ExUxy 

0 


sym 

Ey 

0 ^xy^yx") 


in which E x v xy = E y v yx . If the material is isotropic then E x = E y = E, 
v xy = v yx — v, and G xy = G — E/ 2(1 4- v), where E is Young’s modulus 
and v is Poisson’s ratio. 


Plane Strain. For plane strain problems, if the material is isotropic 


(24) 


D e 


E 

(l + v)(l-2v) 


1 - v 

V 

0 


l-v 

0 


sym 

(1 -2v)/2 


Axisymmetric Problems. For axisymmetric problem, there are four 
components of strain 


du 
dr ’ 


= 


e z = 


dw 

~di' 


du dw 
lrz =d~z + fr' 


which can be written in the form 

e = d„u = d a 

where 

r 

d l 


dr r 
0 0 


0 


d_ 
dz 

d_ d_ 
dz dr 

Expressing u,w in the form of Eqs. (7) and (8), we obtain the element 
matrices 


k = J (d a h) T D a (d a h)rdr dz , 

f = f h T b rdr dz + j h T Tds , 

JA, JdA„i 
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which are similar to Eqs. (13) and (14) except that D e is a 4 x 4 elastic 
constant matrix and d a is a 4 x 2 differential operator defined above. For 
isotropic materials 

sym "I 


1 - v 

0 (1 — 2 i ')/2 


(25) D* 


E 


(1 + i/)(l - 2u) 


l-i/ 

v 

0 


Problem 18.11. Determine the nominal rank of k for axial symmetric de¬ 
formation. 

Problem 18.12. Consider the coordinate transform 
r=r(£,? 7 ), z = z(£,rj). 

Find d„ in terms of differentials d/Q£, d/drj. 

Incompatible Element. The accuracy of triangular and rectangular ele¬ 
ments, more so for triangles, deteriorates when the aspect ratio (the ratio of 
one of the dimension of the element to the other) becomes large. In partic¬ 
ular, when Poison’s ratio approaches 1/2, i.e., the material is approaching 
incompressible, the element will be overly stiff. The effect is specially pro¬ 
nounced in simulating bending of slender beams using plane elements. If 
only one element is used through the thickness, the numerical result is 
practically worthless. It will be shown later that the high stiff behavior is 
caused by spurious shear. Wilson et al. (1973) introduced quadratic terms 
in the deformation that 

(26) u = hq + (1 - £ 2 )a 5 + (1 - rf) a 6 , 

where h is the same as that for a 4-node rectangle given in Eq. (10) and a’s 
are unknown parameters. The additional terms are zero at the 4 nodes and, 
therefore, do not affect the physical meaning of q. But these terms result in 
discontinuous displacements between two elements along the interelement 
boundaries 

The strain matrix can be written as 

2£ 2 n 

(27) e = Bq + -yB 5 a 5 + -j B 6 a 6 , 


where 




Vt] 

0 


" J/€ 

0 ' 

(28) 

b 5 = 

0 

Xfj 

b 6 = 

0 

-x £ 



X?] 

"Vt) 


.~ x ( 
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in which B is the same as that of Eq. (11), J is given in Eq. (22a), and 
(')Ci (')*; denote partial differentiation with respect to £,»?, respectively. 
Note that both B 5 and Bg are constant for rectangle and parallelogram. A 
substitution of Eq. (27) into Eq. ( 6 ) yields 

(29) II e = iq T kq + a T k ag q + ia T k aa a - q T f - a T f a , 

where k and f are the usual 8 x 8 stiffness and 8 x 1 force matrices of the 
4-node quadrilateral as those given in Eqs. (15) and (16), and 


= J [ 24 B 5 2 ??B 8 j n e Bd(,dr), 

kaa = J [ 2 £B 5 27?B 6 ] T D e [ 2^5 2 v Be]^~, 



Defining q' T = q T a T j, we can write Eq. (29) in the standard form with 
the element stiffness and force matrices associated with q' being 



By virtue of the fact that a does not appear in other elements, the vanish 
of the first variation of the functional with respect to a implies JIl e = 0 , 
i.e., 

k a ,q + k oa a — f„ = 0 or a = -k'Jk^q + k"^ , 

which can be used to eliminate a from Eq. (29). This static condensation 
process leads to the following modified element stiffness and force matrices 
associated with the column matrix q 


k* = k — k^k^ko,, 


f - K„kjf a , 


for the incompatible element. 

The incompatible element provides improved performance in bending 
applications. However, its behavior is erratic when the element is not a 
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rectangle or parallelogram. Taylor et al. (1976) modified the incompatible 
modes by treating B 5 , Be of Eq. (28) as constant with their values evaluated 
at £ = 7 ] = 0. The motivation of the modification is to remove the spuri¬ 
ous shear induced by the isoparametric transformation when the element is 
not a rectangle or parallelogram. Approximating B 5 , Be as a constant in 
the evaluation of element matrices is a form of reduced integration, which 
is a technique often used to reduce the stiff behavior of certain elements. 
Taylor’s modification also makes the element satisfy the patch test , a con¬ 
dition required for the convergence of incompatible elements. The details 
of reduced integration and patch test will be discussed in later sections. 



Fig. 18.14:1. Coordinate rotation. 


Rotation. In order to use the 
assembly procedures described in 
Sec. 18.4, all generalized coordi¬ 
nates of different elements must 
have the same physical meaning at 
their common nodes. For example, 
in plane elasticity just considered, 
the components of the displace¬ 
ment vectors of different elements 
at a common node must refer to 
a common coordinate system. In 
practice, only one such coordinate 
system, the global system, is used 


for all the nodes of the whole body. However, in generating the element 
matrices, a local coordinate system may be more convenient. Modeling a 
two dimensional truss by one-dimensional rods as discussed in Sec. 18.6 is 
such an example. For orthotropic materials, it may be desirable to have 
the local coordinates aligned with the material coordinates of the element 
in deriving the element matrices. Let x', y' be the global coordinates and 
x, y be the local coordinates as shown in Fig. 18.14:1. The two systems are 
related by 

x = /3x', 


where 


0 = 


cos 8 — sin 6 
sin 6 cos 8 


The displacements at the j th node is transformed by 
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where ft is the rotation matrix for node j which can be different for different 
nodes. For an element with p nodes, we have 


(30) 


R q' 

2px2p 


where 


R 

2px2p 


A 


ft 


o 

ft 


is the rotation matrix for the element. In general, fts can be different from 
each other. The number of submatrices on the diagonal of R is equal to 
the number of nodes in the element. The element matrices associated with 
q' are derived by substituting Eq. (30) into Eq. (12): 


<m e = dq r (kq — f) = Sq n (k'q' — f'), 


where 

(31) k' = R r kR, f' = R 7 f. 

The element matrices k' and f' are used for assembly. 

There are other circumstances that one must rotates the coordinates. 
In mixed-boundary problems, for instance, the components of the displace¬ 
ment can be prescribed in certain directions while the components of trac¬ 
tion are prescribed in another direction, and these directions are not the 
same as the global coordinates or the local coordinates of the element. This 
will require a transformation of the coordinates of those boundary nodes 
to the directions of prescribed displacements and tractions, and a trans¬ 
formation of other nodes to the global directions, if needed. The element 
matrices would have to be modified accordingly. 

In summary, the finite element method is to derive an approximate 
solution based on the weak form formulation, the principle of virtual work 
or variational calculus. The method expresses all field variables in terms 
of shape functions in the construction of the element matrices. For most 
cases, the shape functions are simple polynomials independent of the type 
of problems. Thus the method allows generating element matrices in a 
generic way to approximate the behavior of elements. Then, before solving 
for the nodal unknowns on high speed computers, the element matrices 
are assembled systematically into a system of algebraic equations for the 
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unknowns and constraints are imposed to enforce the rigid boundary con¬ 
ditions. The process of constructing and assembling element matrices, and 
enforcing rigid constraints is based on the condition that the sum of the 
first variation of the functionals of all elements is zero, i.e., 

£ <5n e = £ «$q T (kq-f)=0 

all element all element 

with the components of <Sq corresponding to rigid constraints being zero. 
We denote symbolically the process as 

(32) Kq - F = / B T B e BqdA-( [ h T b dA+ f h T Tds) =0. 

JA \Ja JdA a ) 

Here q represents the nodal unknown matrix for the whole system. 
Equation (32) is then solved using high speed computers. Thus in the 
finite element method, solving a set of differential equations with prescribed 
boundary conditions becomes the task of discretizing the domain by ele¬ 
ments, constructing the element matrices, labeling the nodes, assembling 
the element matrices to form a set of algebraic equations, imposing 
constraints to enforce the rigid boundary conditions, and finally solving the 
algebraic equations. 

Problem 18.13. Show that rotation does not change the rank of k. 


Problem 18.14. Let the constitutive law be in the form 


where 


du 


a = D t 


dv 


du dv 
dy dx 


* x dx’ yy dy’ xy ~ 
are components of the strain tensor. Show that Dt is in general not symmetric. 


18.15. THREE-DIMENSIONAL SHAPE FUNCTIONS 

Generalizing the finite element process to three-dimensional continua is 
conceptually straightforward. It is just a matter of discretizing the three 
dimensional body into a finite number of solid elements, deriving the appro¬ 
priate three-dimensional shape functions, and generating the corresponding 
element matrices based on an appropriate integral formulation. The proce¬ 
dures of assembling, constraining, and solving algebraic equations for one-, 
two- or three-dimensional problems are all the same. The complication 
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of three-dimensional problems is more in the implementation such as the 
description of the geometry, loading, higher semi-bandwidth, many more 
algebraic equations, etc. Discretizing a complex three-dimensional body 
into a finite number of solids, especially the hexahedral ones, is not a trivial 
matter. Labeling all the nodes to give an optimum average semi-bandwidth 
is much more complicated than that for the two dimensional case. 

In this section we shall limit ourselves to the derivation of three dimen¬ 
sional shape functions and the corresponding element matrices. 

Shape Function Variation on Element Boundaries. The variation of 
shape functions is restricted by the requirement of full compatibility , i.e., the 
compatibility of a field variable at the inter-connecting nodes of two adja¬ 
cent elements guarantees compatibility over the entire inter-element bound¬ 
ary containing these nodes. If the shape functions on an element boundary 
surface are polynomials of the surface coordinates and if the highest order 
of the polynomials is j, it will require (j + l)(j + 2 )/ 2 , the number of in¬ 
dependent terms in a complete j th order polynomial, matching conditions 
to assure that the interpolated functions of the adjacent elements are fully 
matched, i.e., continuous everywhere over the inter-element boundary. In 
other words, if the shape function involves j th order terms of polynomial 
on element boundaries, it needs ( j + l)(j + 2)/2 inter-connecting nodes on 
each inter-element surface to guarantee continuity of two functions over the 
common boundary of two elements. For example, continuity is assured for 
a 3-node surface if all shape functions are linear, for a 6 -node surface if they 
are quadratic, for a 10 -node surface if they are cubic, and so on. 

Shape Functions for Tetrahedron. Consider a family of progressively 
higher order tetrahedral elements with node j at ( Xj,yj,Zj ) as shown 
Fig. 18.15:1. For convenience, we shall describe the elements using the 
volume coordinates, 

oj + b jX + Cj y + djz 
W- 6Av 

where A„ is the volume of the tetrahedral element 




1 

1 

1 

1 

(2) 

. 1 

Tl 

X2 

T 3 

£4 







2/1 

2/2 

2/3 

2/4 



Zi 

Z2 

*3 

Z4 
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3 




Fig. 18.15:1. Tetrahedral elements. 


( 3 ) 



X2 

x 3 



1 

1 

1 

ai = 

2/2 

2/3 

2/4 

bi=- 

2/2 

2/3 

2/4 


Z2 

23 

z 4 


Z2 

23 

z 4 



1 

1 

1 


1 

1 

1 

Cl = 

X2 

T3 

X4 

di = - 


23 

X4 


22 

23 

24 


2/2 

2/3 

2/4 


with other 6 ’s, c’s and c’s obtainable through the cyclic permutation of the 
subscripts 1, 2, 3, 4 and A„ being the volume of the tetrahedron. The C’s 
are the natural coordinates for tetrahedron and, similar to the triangular 
coordinates for triangle, are normalized volume of certain tetrahedron, e.g., 


( 4 ) 


, Ap234 

<1 = ^T’ 


where Ap 234 is the volume of the tetrahedron with nodes P, 2, 3, 4 in which 
P is a point within the tetrahedron. The other volume coordinates C 2 . C 3 > 
C 4 are obtained by the cyclic permutations of the subscripts. Note that C’s 
are not independent that they satisfy 


Ci + C2 + C3 + C4 = 1 • 


The evaluation of the element matrices of higher order elements will involve 
integration of quantities defined in terms of the volume coordinates. The 
following exact integration formula is useful 


f crctoiw = 6A„ 

J A„ 


Q:!^5!'y! A! 

(a + /3 + 7 + A + 3)! ’ 
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which is similar to the formula for the triangular coordinates in two- 
dimension. A field variable <f> can be represented over an element by a 
three-dimensional function that interpolates the nodal values qj of the field 
variable, i.e., 

p 

(5) <f> = 'Y^h j {x,y,z)q j , 

i=i 

where p is the number of nodes of the element and hj are shape functions. 

Four-node Tetrahedron. A four-node tetrahedral is the simplest solid 
element [Fig. 18.15:l(a)]. The shape functions are simply 

(6) h 3 = 0 , 

which are linear functions of x,y,z. The shape functions are linear on 
the element boundary and thus requires 3 inter-connecting nodes between 
adjacent element for full compatibility. Since each side of tetrahedron has 
3 nodes, full compatibility can be assured. The strains in the element are 
constant. Therefore, a 4-node tetrahedron element is also called a constant 
strain tetrahedron. 

Ten-node Tetrahedron [Fig. 18.15:l(b)]. The shape function hj, which 
equals 1 at node j and zero at all other nodes, can be generated from the 
product of two functions, each of which represents a plane passing through 
nodes other than node j. The shape functions are 

^ = 0(20-1) j = 1)2,3,4, 

(7) h,5 = 4CiC2 j he = 4^i^3 , h 7 — 4£iC4 , hg = 4&C3 , 
hg = 4 C 3 C 4 , hyo — 4 C 4 C 2 ■ 

These shape functions can represent a complete quadratic polynomial. 
Since a complete quadratic polynomial on a surface has 6 terms, it re¬ 
quires 6 inter-connecting nodes for full compatibility. This condition is met 
by the 10-node tetrahedron. 

Higher order shape functions can be derived from the products of three 
plane functions, the products of four plane functions, and so on. For a 
20-node tetrahedron, there are 16 boundary nodes on the edges and 4 at 
the centroids of the boundary faces. The shape functions are cubic. The 
quartic tetrahedron has 35 nodes, 22 of which are on the six edges, 12 on 
the four faces, and one, an interior node, at the volume centroid of the 
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tetrahedron. One can show that polynomial shape functions can also meet 
the full compatibility requirement. 

Shape Functions for Hexahedron. Two hexahedral elements are shown 
in Fig. 18.15:2. The shape functions for hexahedron are very much like 
those of quadrilateral elements except that now we have three independent 
variables x, y, z. The shape functions can be derived easily by the use 
of the natural coordinates In natural coordinates, the element is a 

cube with faces bounded by £ = ±1, rj = ±1, ( = ±1. The shape functions 
are simply the product of a minimum number of functions of planes passing 
one or more nodes of the element. For the shape function associated with 
node j, all nodes except node j must be passed through by at least one of 
these planes and none of these planes should pass through node j. For a 
8 -node cube, the shape functions in terms of the natural coordinates are 



Pig. 18.15:2. 8- and 27-node brick elements. 
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( 8 ) hj(£, r/) = ( 1 + 00(1 + VjV)(l + 00/ 8 > 

(j not summed), where 

$1 = 0 = O — O = Vl ~ V2 = V5 = l ]6 = Cl = C2 = C3 = O = — 1 , 

62 = £3 = O = $7 = 7/3 = TH = V7 ~ Va = C 5 = Cfi = C 7 = Cs = 1 • 

Equation ( 8 ) is a product of three plane functions 1 + 0£ = 0 , 1 -f T)jT) = 0 
and 1 + 0C = 0. The shape functions on an element boundary surface are 
bilinear and can have at most 4 independent terms. Since each boundary 
surface has four node and thus assures full compatibility. 

The shape functions for a 27-node cube can be derived also. The shape 
function associated node 27, the centroid node, is a bulb function 

h27 = (1-£ 2 )(1 -V)(W 2 ). 

The shape functions associated nodes 21, 22 and 23, the midface nodes 
adjacent to node 1 at c = ~ 1 , £ = — 1 and 77 = — 1 , respectively, are 

h 21 = (l-{ 2 )(l-^)(l-()/2-W2, 
h 2 2 = (1-t? 2 )(1-c 2 )(1-0/2- fcar/2, 
h 23 = (1 - 0)(1 - 0)(1 - »7)/2 - fcar/2 • 

The shape functions associated nodes 9, 12 and 13, the midedge nodes 
adjacent to node 1 , are 

h 9 = (1 - 0(1 - 0(1 - 0(1 + 0/4 - (hn + h 23 )/ 2 - h 27 /4 , 

h i2 = (1 - 0(1 - 0(1 - 0(1 + 0/4 — (h 22 + h 2 i)/2 - h 27 /4, 

hi3 — (1 - 0(1 “ 0(1 — 0(1 + c)/4 — (h 2 3 + h 22 )/2 — h 27 /4. 

The shape function hg is the product of the functions of four planes nor¬ 
malized to unity at £ = 0, 7 ? = c = —1 minus a quarter of h 27 and 
one half of h 2 i and h 2 3 . The shape function associated with node 1, a 
corner-node, is 

L (1 — 0(1“ 0(1 —0 hg + hi 2 + hi 3 h 2 \ + h 22 + /133 h 27 

(9) k, = - - -j-J- — , 

which equals the corresponding shape function of the 8-node brick ele¬ 
ment minus one eighth of the bulb function, one quarter of the sum of the 
shape functions of the three midedge nodes and one half of the sum of the 
shape functions of the three midface nodes adjacent to node 1. The shape 
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functions are bi-quadratic on the element boundaries. The bi-quadratic 
functions have 9 independent terms. The 27-node hexahedron has 9 nodes 
on each face and thus assures full compatibility. The shape functions for 
a 20-node element can be obtained from those of the 27-node element by 
setting /i 2 i = • • • = fi27 = 0. 


Problem 18.15. Derive the shape functions for triangular prisms as shown. 
The heavy dots are nodes of the element. The prism has two triangular and three 
rectangular surfaces. 



(a) six-node prism 


(b) 15-node prism 


Fig. PX8.15. Triangular prism: (a) 6-node prism and (b) 15-node prism. 


Problem 18.16. Derive the shape functions for a quartic tetrahedral ele¬ 
ment. Show that an interior node inside the tetrahedron is needed to achieve 
complete quartic polynomial for the shape functions. 


18.16. THREE-DIMENSIONAL ELASTICITY 

We shall express the field variables of three-dimensional elasticity in 
matrix form 

(1) u r = [u V w ], 

(2) eT — [tr^a: Oyy Ozz ®xy &yz d"zx] , 

( 3 ) e T = [e xx e yy e zz 7 iy -y yz 7 ZX ] , 

where u is the displacement matrix, cr the stress matrix (not tensor), and 
e the strain matrix (not tensor) in which 7 xy ,7yz,7*x are the engineering 
shear strains. The stresses and strains are related by the constitutive law 


a = D e e, 
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D 12 D22 


[ -D 26 ••• Dee J 

is the elastic modulus matrix. The components of D e are related to the 
coefficients of Hooke’s law (<jy = d^ki^ki) by the following equations: 

(4a) Dij — Dji — dujj , Dij + 3 = Dji 3 .1 = d ll j rn ^j^, 

Di+3 t j-i-3 = 1 

where i,j = 1,2,3 and repeated indices are not summed, and m(j) = j +1 
for j = 1,2; m(j) — 1 for j = 3. The use of engineering shear strains in e 
makes D e symmetric. In general, it will not be symmetric if the tensoral 
shear strains are used. For isotropic materials, 

D u = D 22 = D 33 = n f ( l 7 V \ 1 . 

(1 + i/)(l - 2v) 

n E 


U 44 = D 55 = Dee = 


2 ( 1 + 1 /) 


D \2 = D21 = JD31 = Z ?13 = T >32 = T >23 = 7 i ,-- 7 T\ > 

(1 + z/)(l - 2v) 

while all unlisted D 's are zero. The strain can be expressed in terms of the 
displacement as 

(5) e = d e u, 

where d e is a matrix differential operator that its transpose is 


E « 0 & 0 s 

<6) dr = » i » I I ° ' 

. » ° I • h S. 

Consider a problem over a domain V with prescribed boundary condi¬ 
tions 


u = u on dV u , 
T = T on dV a , 
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where dV u +dV a = dV is the boundary of V and T is the boundary traction 
with components 


T) — Tli&xx “b Tl^Oxy ~b TIq&xz , 

P'l “ 7l\(7 x y T ri2CTyy ~b Tl^(Jy Z , 

■^3 — z “b n.2*7yz "b n^a zz , 

in which n is a unit normal to dV a . The potential energy for an element is 

(9) Ele = / (^e r D e e - u T bj dV - J u r T dS , 

where V e is the volume of the element and b is the distributed body force 
matrix. Here, b has three components as opposed to two in Eq. (18.14:6) 
for plane elasticity. If there are mixed boundary conditions, i.e. a combina¬ 
tion of components of the displacement and traction are prescribed, terms 
similar to those included in Eq. (18.14:6a) must be added to Eq. (9) to 
account for the prescribed traction component (s). The admissibility condi¬ 
tions are that u is C° continuous and u = u on dV ue . The first variation 
of the potential energy with respect to the field variable u is 

(10) Sn e = f (5e T D e e-6u T b)dV - f 6u T TdS 

JV c JdV„' 

= [ [(d e Ju) r D e (d e u) -Su T b}dV- [ <5u t T dS. 

JV t JdVae 

In integral formulation, the sum of the first variation dll e of all elements is 
required to be zero 

Y, <® e - o 

all element 

for all admissible <5u. The prescribed displacement Eq. (7) is a rigid condi¬ 
tion, which must be enforced, i.e., the components of <5u corresponding to 
rigid constraint are set to zero. The prescribed traction Eq. (8) is a natural 
condition, which is accounted for automatically by the integral term over 
dV ae . 

The procedure for establishing the finite element equations using 
integral formulation is identical to that for plane elasticity with the two- 
dimensional quantities being replaced by the corresponding three-dimen¬ 
sional ones. Consider an element with p nodes at Xj , yj , z 3 , j = 1,..., p. 
Let the nodal values of u, v and w be 


qsj-2 = , q 3j -i = v{xj,yj,Zj) , <7 3 j = w{x i ,y j ,z j ), 
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where q's are the generalized coordinates. Each node now has three degrees- 
of-freedom as opposed to one or two considered earlier. We interpolate the 
field variables that 


^2hj(x,y, z )q 3 j-i , 


( 11 ) 


j= i 


w = Y^hj{x ) y,z)q 3i , 
j= 1 

or in the matrix form 


p 

v = Yl h ^ x ' y ' z ^~ 2 ' 

i=1 


(12) u = hq, e = d e u = d e hq = Bq, 

where h is the 3x3 p interpolation matrix and q is the nodal displacement 

h p 0 0 

0 h p 0 , 

0 0 h p 

q T = [gi 92 Q3 P -i q3 P }- 

1x3 p 

A substitution of Eq. (12) into Eq. (10) yields the standard equation for 
the element 

(14) <5n e = <5q r (kq - f), 
where 

(15) k = f B T D e B dV, 

Jv c 

(16) f = [ h T bdV+ f h r T dS. 

iv, 

These equations are in the similar form as Eqs. (18.14:13) and (18.14:14) 
except that D e ,d c ,B and h are defined in Eqs. (4), (6), (12) and (13), 
respectively, for 3-dimensional body. The h’s of a four-node tetrahedron 
are linear functions of x,y,z, and the corresponding strain e(= Bq) is 
constant. Thus, a four-node tetrahedral element is called a constant strain 
tetrahedron. A 10-node tetrahedral element has quadratic displacements 
and linear strains and is, therefore, called a linear strain tetrahedron. The 
rank of k is 3p — 6 if the shape functions are capable of representing all 6 


matrix of 3 p components 

hi 0 


( 13 ) 


h = 

3x3p 


0 

hi 0 
0 hi 
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rigid body modes. Approximate integration of Eq. (15) can lower the rank 
of k and introduce spurious deformation mode(s). 

For an element with curved surfaces, we introduce the transformation 

all element nodes 

y = yj fi j(^ r hO . 

all element nodes 

* = z i h Mi r hQ 

all element nodes 

to map a tetrahedral element V e into a right angle tetrahedron in the first 
quadrant with 0 < £, 77 , £ < 1 and a hexahedral element into a cube with 
~1 < £,r],( < 1. The element matrices can be written in the similar form 
as Eqs. (18.14:15) and (18.14:16) 

(17) k = f B T DBJd£ drfdC ,, 

Jv t 


( 18 ) f = / h T b Jd£dr]d(+ [ h r T dS, 

JV' Jov„' 

in terms of the natural coordinates £, 77 , £ where 

B = d e h 

with the components of d e [Eq. (18.6:6)] calculated from 

0 d£ d dr] d d<; d 

da da d£ **" da dr] da dr,' a x '^' z ' 

Since the transformation £ = £,{x,y,z), 77 = r](x,y,z) and £ = C( x i2/> 2 ) I s 
generally not available, we have to evaluate d£/dx, using the following 
relations: 


dx 

dy 

dz 


■ a? 

dr] 

d£ ' 


dt 

di 


dx 

dx 

dx 

dx 

dy 

dz 

, J- 1 = 

dt 

dr] 

d< 

dr] 

dr/ 

dy 


dy 

dy 

dy 

dx 

dy 

dz 


d£ 

dy 

dc 


d( 

dc . 


dz 

dz 

dz 


(19) J = I — 


The integration over dV ae becomes integration over a plane surface(s) 
in the transformed coordinates, which is a right isosceles triangle of area 
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of 1 /2 for tetrahedral elements or a square of area of 4 for cubic elements. 
If the surface coordinates of the boundary are £ 1 ,^ 2 , which are two of the 
three natural coordinates £,77, <^. Then 


dS = 


dxk dxk dxk dxk 

Wi + dfr d& 


dx k dxk 

aii W2 


d£ 1 2 , 


where x = xj, y = x 2 , z = X 3 and repeated indices denote summation from 
1 to 3. For a boundary surface with the normal in the ^-direction, £ 1 , £2 
are £, 77 and so on. 

For tetrahedral elements, the shape functions are actually in terms of 
the volume coordinates Cii C 2 , On Co We have 


£ = Ci 1 V = C2 , C = C3 , C4 = 1 — Cl — C2 — C3 - 

Differentiations with respect to the coordinates of the two systems are re¬ 
lated by 

a a a a a a a a a 

at ~ aci 3Ct ’ a v ~ a( 2 d( 4 • ac ~ d( 3 ~ au ' 

We see the similarity among the element matrices for one-, two- and 
three-dimensional problems. For a n-dimensional continuum, the evalua¬ 
tion of element stiffness matrix and the portion of element force matrix due 
to the distributed body force requires n-dimensional integration. The eval¬ 
uation of the element force matrix associated with the applied boundary 
loads requires (n — l)-dimensional integration. The latter element force is 
a quantity evaluated at one or more boundary points for one-dimensional 
problems while it involves line and surface integrals, respectively, for two- 
and three-dimensional problems. The finite element formulation for elas¬ 
ticity discussed so far involves only displacements as field variables and is 
thus called the displacement model. The number of degrees-of-ffeedom per 
node in the displacement model is normally n for n-dimensional elasticity. 

The finite element method permits a systematic process for selecting in¬ 
terpolation functions, constructing element matrices, assembling them into 
a system of simultaneous algebraic equations and determining the solu¬ 
tion. Interpolation functions, assembling process, and solution methods for 
a large system of equations are usually generic, i.e., they can be applied 
to almost any types of problem. The element stiffness matrix depends on 
problem type rather than the specifics of the problem. The element applied 
force matrix depends on the distributed loads, but the process of determin¬ 
ing it is again generic. For example, we have derived element stiffness 
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matrices for beams, plane problems, 3-dimensional solids, etc. We can use 
the element stiffness matrices for plane elasticity to solve a plane problem 
with a square hole or with a circular hole. The nature of the problem is 
reflected in the geometry (element sizes and shapes), the rigid constraints 
on the boundaries, and the value of the distributed load. These inherent 
generic characteristics of the finite element method make it easy to apply 
to almost any problems. 

In integral formulation, the sum of the variation <5II e of all elements 
is zero 


( 20 ) £ 6U e 

all element 


E 


[ (<5e T D E e - Su T b)dV - 

f 5 u t T dS 

Jv e 

JdV<y e 


= Y, «<i T,k 'i- f ) -» 


with the components of <5>q corresponding to rigid constraints being zero. 
We denote symbolically the summation as 

(21) Kq — F = f B T D € BqdV- ( / h T bdV+ [ h 7 TdS'j =0, 

JV \JV JdV„ J 

which is the same as Eq. (18.14:32) except that the area and line integrals 
are replaced by volume and surface integrals, respectively. Since cr = D e e = 
D e Bq, Eq. (18.16:21) can be written as 

(22) [ B T cr dV - F = 0. 

Jv 

18.17. DYNAMIC PROBLEMS OF ELASTIC SOLIDS 

The finite-element formulation given in this section can be generalized 
to time-dependent problems involving of dynamic behavior of structures. In 
such problems the finite-element idealization leads to a set of simultaneous 
dynamic equations of the form 

(1) Mu + Cu + Ku = F, 

where u represents the matrix of generalized coordinates or unknown pa¬ 
rameters for the entire structure and dot(s) over a character denotes differ¬ 
entiation with respect to time t. The matrices M and C are the assembled 
mass and damping matrices, respectively. For time independent problems 
these matrices do not appear because ii, u — 0, and Eq. (1) reduces to the 
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equation governing the static elastic response of the structure, where K is 
the familiar constrained assembled stiffness matrix introduced earlier. As 
will be seen the matrix M is symmetric and positive definite, and when 
linear viscous damping prevails, the matrix C is also symmetric. The dy¬ 
namic problem now becomes an initial-valued problem of finding a column 
matrix u = u(f), satisfying Eq. (1) and the given initial conditions: 

u(0) = u 0 , ii(0) = v 0 . 

A variational formulation can be used in conjunction with the finite el¬ 
ement discretization to derive appropriate the mass and damping matrices. 
The process is an application of Hamilton’s principle to a dynamic system. 
We consider the integral 

rH 

(2) H = j t ( U-W-T)dt , 

where T is the kinetic energy; U the strain energy; and W the work of the 
applied loads. Hamilton’s principle may be written as 

(3) 6H = 0, 

and stated as follows: among all possible time histories of displacement con¬ 
figurations that satisfy compatibility and rigid boundary conditions and the 
conditions at time t\ and t 2 , the actual solution makes H a stationary value. 

The following simple example can illustrate these ideas further. Con¬ 
sider the spatially one-dimensional dynamic problem of the axial displace¬ 
ment u(x,t) of a rod with prescribed loads F\, F 2 at both ends. The 
governing equation is 


(4) 

where pA is the mass per unit length, p is an applied distributed axial load, 
and AE is the rod axial stiffness. All these quantities can be a function 
of x, t. Also assume that u(x, t x ) and u(x, £ 2 ) are prescribed values of u at 
tj and t 2 - Let the rod be divided into N elements. For the e th element 
between points x e and x e +i, the nodal values at these points are denoted 
by u(x e ,t) = <7e- An integral formulation of Eq. (4), which can also be 
identified as the Hamilton’s principle, is expressed by 5H = 0 with respect 
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to u, where 

(5) H = J* |E £‘ +1 



2 

— p(x, t)u 


1 

2 


p(x, t)Au 2 


dx + Fi(t)u\ Xo 


F 2 m XN+1 


dt. 


In most elasticity applications, the density pA and the equivalent modulus 
AE are independent of time. However, this is not true for biological or 
living materials, where the time variation of the material plays an essential 
role influencing the behavior of the system. 

We assume u in the form 


(6) u = h(rr)q e (t), 5u = h(x)6q e (t), 

where h(rr) is the shape function matrix, which is the same as that of the 
static problems and q„ is the generalized-coordinate matrix for the e th el¬ 
ement. The assumption of a field variable in the form of Eq. (6) is most 
fundamental in the finite element approach to dynamic problems. It sepa¬ 
rates the field variables as the product of a space-like function h(x) and a 
time-like function q e (f) where h(x) is in the exact same form as in static 
problems. This allows us to construct the element stiffness and force ma¬ 
trices in the same manner discussed before. We then have to construct, in 
addition to the stiffness matrix, only the mass and damping matrices for 
dynamic applications. 

We can write H in the form 

H = J t jXJ -qj’fe(t) - ^qrm e (t)q c 


+ - F 2 (t)<7iv + i 



where k e , and f e are the element stiffness matrix and the element force 
matrix, which are derived in the same manner as before with the exception 
that k e and f e can be a function of time for dynamic problems. For this 
example, m e is the element-mass matrix defined by 

r x e +1 } J 

(7) J -pA{x,t)u 2 dx = -qjm e q e . 

After all elements are summed, we have 

(8) H = j' 2 Qu t Ku - u r F - iu T Mu + F iqi - P l q N+l ) j dt , 
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where u is the nodal parameter matrix of u. Setting the first variation 6H 
with respect to u’s to zero, integrating by parts, and noting that <5u(tj) = 
<5u(t2) — 0, we obtain the Euler equations, 1 

Mii + Ku = F, 


for the dynamic system. The matrices K and F are from the strain energy 
and work terms, respectively, in Eq. (5) and M results from the considera¬ 
tion of the kinetic-energy. 

When the interpolation function used in Eq. (7) is the same as those 
used to obtain k e , the mass matrix is referred to as a consistent mass 
matrix. Consistent element-mass matrix is usually fully populated. In 
practice, diagonal or lumped mass matrices are often employed due to their 
general economy especially in the explicit time-integration methods. As 
will be seen later in this section, each approach has certain advantages and 
disadvantages with regard to computer implementation. 

The lumped mass formulation assumes that a certain amount of struc¬ 
tural mass surrounding a given node is concentrated or lumped at that node. 
There are many ways to construct lumped mass matrices (Tong et al. 1971, 
Hughes et al. 1976, Fried and Malkus 1976, Hinton et al. 1976). However, 
most techniques are ad hoc. One approach is to integrate the element 
mass matrix using an integration formula with integration points at nodes 
only. This effectively diagonalizes the mass matrix. Another technique is 
to sum the quantities of each row of the consistent matrix to the diagonal 
if all generalized coordinates have the same physical dimension. Hinton 
et al. proposed setting the entries of the lumped mass matrix proportion¬ 
ally to the diagonal entries. Goudreau (1970) showed that averaging the 
consistent and lumped mass matrices for the one-dimensional second order 
equation improves the rate of convergence of the finite element method. 
No general theory of obtaining higher-order accurate mass matrices has yet 
been established. 

To continue with the example of Eq. (4), we can use the same linear 
interpolation as before at the element level: 

» = ?i(l-0/2 +fc(l+0/2, 

where £ = ( 2x — x e — x e +i)/e, e = x e+ i — x e and q’s are the nodal nodal 
values of u at the first and second nodes. To derive a consistent mass matrix 

1 If the displacements are prescribed at the ends, Sq’s corresponding to the prescribed 
values of u are zero to fulfill the rigid constrained conditions. One must then use the 
constrained M and K in the Euler equations. 
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for constant density over an element from Eq. (7), we integrate 

pA J v?dx — pA J [< 7 i(l - £)/2 + q 2 (\ + £}/ 2 ] 2 cte = q^m e q e , 

where the subscript e denotes values associated the e th element and m e is 
the element mass matrix 


(9) 



1 1/2 
1/2 1 


In the lumped mass approach, we lump half of the element mass to each 
node. The matrix becomes 


( 10 ) 



1 0 
0 1 


The consistent mass matrices for beam and triangular plane-stress elements 
can be derived in similar fashion. For a beam [Eq. (18.3:1)], we approximate 
the deflection in terms of u(= q i) and du/dX(= < 72 ) at £ = — 1 and the 
corresponding values < 73 , qq at £ = 1. We can then employ the same cubic 
interpolation Eq. (18.3:6) used to derive the element-stiffness matrix k to 
calculate the kinetic energy. The consistent element mass matrix is given by 

sym 
4e 2 

13e 156 

[ — 13e -3e 2 -22e 4e 2 

for an element having a uniform mass distribution where again pA is mass 
per unit length. The corresponding lumped mass matrix can be obtained 
by lumping half the beam element mass and rotary inertia onto each node, 


( 11 ) 


pAe 

420 


156 

22 e 

54 


( 12 ) 



10 0 0 

0 e 2 /12 0 0 

0 0 10 
000 e 2 /12 


The rotary inertia is associated with the angular deflections du/dx at the 
nodes. Lumping the rotary inertia at a node as (e 2 /12) is ad hoc at best. 
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One can even neglect them completely in Eq. (12). In this case, the mass 
matrix becomes singular. This makes the numerical integration of Eq. (1) 
by an explicit integration scheme more cumbersome. 

For the triangular elements in plane elasticity, we use the same inter¬ 
polation functions as those in Sec. 18.9 for evaluating the element-stiffness 
matrix, i.e., 

u = hq, 

where h's are defined in Eqs. (18.9:4) and (18.9:10). The expression for 
the kinetic energy of an element of thickness d , area A e , and mass per unit 
volume p is 

(13) i J pdu T udA = pdh T hdA^j q e = ^q^m e q e . 

Thus, the consistent mass matrix is given by 
2 

0 2 sym 

1 0 2 
0 10 2 
10 10 2 

0 10 1 0 2 

for an element of uniform mass and thickness. One defines the correspond¬ 
ing lumped mass matrix by lumping one-third of the element mass onto 
each node 

1 

0 1 sym 

0 0 1 

0 0 0 1 

0 0 0 0 1 

0 0 0 0 0 1 

Note that the total mass of the element is equally divided among the 
nodes. 

In Eqs. (13)—(15), the mass matrix is based on the total mass of the 
element. In Eqs. (18.14:13) and (18.14:14), the element stiffness and loading 





720 


FINITE ELEMENT METHODS 


Chap. 18 


matrices are values per unit thickness. For plane stress, before assembling 
them for dynamic study, if the thickness is uniform, we simply multiply 
those matrices by the element thickness. If the thickness varies over the 
element, we must include the thickness as a multiplication factor in the 
elastic modulus matrixD e in Eq. (18.14:13) or (18.14:15), the body force b, 
and the boundary traction T as in Eq. (18.14:14) or (18.14:16). In plane 
strain, one only has to deal with problems per unit thickness. In this case, 
one may simply put d = 1 in Eqs. (14) and (15). 

In order to determine the damping matrix C, we need to consider the 
virtual work done by damping forces. As an example, consider the case 
where the damping force is proportional to velocity (i.e., damping force 
= judA). The virtual work done is then 

J 5n T ^iidA = 8qJ {^J 7h r hcL4^ q e = <Sq;fc e q e , 

which is similar to the element mass matrix. An often-used form of C is 
the Rayleigh damping matrix 

C=aM + bK, 

where a and b are constant parameters. 

Both matrices M and C for the entire structure are assembled from 
element matrices m e and c e in a fashion analogous to that for obtaining K. 
Once the system of equations implied by Eq. (1) is established, its solution 
can be obtained by time integration or modal analysis. The eigenvalue 
problems (free vibration) corresponding to homogeneous system of Eq. (1) 
can also be treated by well-established techniques (Clough and Bathe 1972). 

From the point of view of computer implementation and the questions 
of accuracy, let us examine the pros and cons of employing the consistent 
and lumped mass matrices. The lumped mass matrix is often a diagonal 
matrix. This requires less computer storage and less time to generate than 
the corresponding consistent mass matrix (a consistent mass matrix requires 
the same amount of storage space as the stiffness matrix). The diagonal 
form also facilitates calculation when an explicit time-integration scheme is 
used, where new vectors to be computed in a given time step are functions 
only of known vectors computed in previous steps (Clough and Bathe 1972; 
Tong and Rossettos 1974). As an example of the integration of Eq. (18.17:1) 
with C = 0 by the central difference scheme, we have 

Mu n+ i = M(2u„ - u„_i) - Ku n (Af) 2 + F(f n ) , u(0) — u 0 , 

where u n denotes the solution for u at time t — t n (= nAt) with At being 
the time increment of each integration step. If M is a diagonal matrix, 
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u n +i can be evaluated rapidly from u n and u n _i. Otherwise, solving for 
u n+ i at each step requires considerably more computer time. 

A particularly useful feature and the principle advantage of using con¬ 
sistent mass matrix is, when the interpolation is conforming and the eval¬ 
uation of all element matrices is carried out by full integration, the natural 
frequencies obtained for vibration problem are upper bounds. The use of 
lumped mass matrix in compatible model tends to lower the frequencies. 
There may be instances where more accurate frequencies are obtained by 
the lumped mass approach. 

In regard to the rate of convergence of mode shapes and frequencies by 
the finite-element method using consistent and lumped mass formulations, 
the following results have been established (Tong, Pian and Bucciarfelli 
1971). In cases where certain lowest-order elements are used, such as the 
constant-strain triangle or four-node quadrilateral elements, a lumped mass 
matrix provides the same order of approximation as a consistent mass ma¬ 
trix for second-order differential equations. However, no systematic proce¬ 
dure has been found to lump the mass that guarantees improvement in the 
convergence of higher order elements or for problems involving higher or¬ 
der differential equations. As a matter of fact, consistent mass matrix often 
gives higher-order approximation. In general, what can be recommended is 
that lumped mass matrix be used for low-order elements described above, 
where it provides the same order of approximation. 

In order to treat problems in real structures, however, such as auto 
or rail vehicles, buildings, or bridges, where the mass distribution itself is 
often distinctly concentrated in certain areas, one often uses a gross ap¬ 
proximation for the stiffness description. In these cases, one should use a 
lumped-mass approach. An example where this concept has been used to 
advantage concerns the structural deformation of vehicles in a crash (Ros- 
settos and Weinstock 1974; Tong and Rossettos 1974). The vehicle can 
be conveniently divided into individual modules, each with its own stiff¬ 
ness and mass characteristics. The modules are connected by the standard 
finite-element methodology. Mass lumping becomes a clear choice for such 
physical problems (e.g., engine mass, etc.). 

One may solve Eq. (1) by modal analysis or by step-by-step integration 
in time. Readers are referred to any books on dynamics regarding modal 
analysis. We will discuss only the time integration methods, also called the 
direct methods. The accuracy of the direct methods is measured in terms 
of the truncation error. If the truncation error is of the order At m+1 , the 
integration scheme is said to be m th -order accurate. 
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Newmark /3-Method (Newmark 1959). This is one of the most widely 
used direct integration methods for 2nd order ordinary differential equa¬ 
tions. The Newmark /3-method rewrites Eq. (1) in the approximate form: 

M„+ia n+ i + C n+ iv„ + i + K n+ iq n+ i = F n+ i, 

(16) q„+i = q„ + Af„v„ + Af 2 [(l - 2/?)a n + 2/3a n+i ]/2 , 
v„+i = v n + At[(l - 7 )a„ + 7a«+i], 

where At is time step size and (■)„ denotes the value of the concerned 
quantity at t — t n that q n , v n , and a n are the approximations of q(t n ), 
q(f n ), q(f„), respectively. In most of structural problems, M and C axe 
independent of time. This is not in the case of biological systems. 

The following recursive formula for a n+ i can be derived from Eq. (16) 

(17) (M + r yAtC + /3At 2 K) n+ ia n +i = F n+ i — C n +iv n+ i — K n +iq„+i, 
and 

(17a) q n +i = q„+i + /3At 2 a n+1 /2 , v n+1 = v n+1 + 7 Ata„ +1 , 
where 

(17b) q„+i = q n + At„v„ + At 2 (l - 2/3)a„/2 , 

Vn+i = v n + Af(l - 7 )a n . 

To start the integration from the initial condition q = qo and v = vo, one 
first calculates ao from the first equation of Eq. (16) 

Moao = Fo — CoVo — Koqo > 

and evaluates qi and vj from Eq. (17b). One then solves for ai from 
Eq. (17), determines q 1( Vi from Eq. (17a), proceeds to determine a 2 , q 2 
and V 2 of the next time increment and so on. 

An integration scheme is called explicit if a n+1 can be obtained without 
solving the system of algebraic equations Eq. (17). Otherwise the scheme is 
called implicit. Obviously if M and C are diagonal and (3 — 0 (K is usually 
not diagonal), we have an explicit scheme. In practice, we often called the 
scheme explicit as long as (3 = 0 even when M and C are not diagonal. 
Many well-known integration schemes are special cases of the Newmark 
method. For example, the trapezoidal rule corresponds to /? = 1/4 and 
7 = 1/2, which is an unconditionally stable implicit scheme; (3 = 1/6 and 
7 = 1/4 gives the linear acceleration method, an implicit scheme; and 
(3 = 0 and 7=1/2 gives the central difference method which is an explicit 
scheme. It can be shown that the accuracy is of the order At 2 , said to 
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be second order accurate , for both cases. An integral algorithm is always 
stable regardless of the size of At is said to be unconditionally stable. 

For some values of /3, 7 and At, the integrated solution grows exponen¬ 
tially. This phenomenon is called numerical instability. The parameters /3, 
7 and At also affect the accuracy of the solution. In practice, one must 
choose proper values for /3, 7 and At to control both the accuracy and 
stability of the algorithm. 

One can determine the condition when instability will occur. For the 
Newmark method with C = aM 4- 6K, the stability conditions {Goudreau 
and Taylor 1972, Krieg and Key 1973) are 

Unconditionally stable : 2/3 > 7 > 0.5 

Conditionally stable : 2/3 < 7 , wA t < fi cr j t , 


where 

„ $(7- 0-5) + [0.57 -/3 + £ 2 (7-0.5) 2 ]V2 a \ (a , L A 

n crit = ---, * = 2 (- +j • 

The quantity u is the maximum undamped frequency of the system and £ is 
the corresponding damping ratio. For 7 > 1 / 2 , one can use fl cr j t = (O.57 — 
/3 ) -1 / 2 as a conservative value when a realistic estimate of the damping 
coefficient is not available. The maximum frequency of the system can 
be obtained from the maximum eigenvalues of individual elements (Tong 
1970b). Hughes (1987) listed the maximum frequencies for a number of 
elements. 

For explicit procedures, the time increment At required for stability is 
inversely proportional to the maximum frequency (Tong et al. 1971, Clough 
and Bathe 1972, Hughes 1987). Since the lumped mass approach usually 
gives to a lower maximum frequency as compares to that of the consistent 
mass formulation, a larger time step could be used for integration with 
lumped mass matrix. This particular advantage is no longer clear when an 
implicit integration scheme is employed. 

Multi-Step Methods for First-Order Equations (Gear 1971). The 
multi-step methods for first-order equations are used to integrate equations 
of the form 

(18) y = f(y ,t). 

One can use the multi-step method to integrate Eq. ( 1 ) by converting it to 
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a set of first-order equations 



The integration is defined by 
k 

(19) + A Atf(y„+i-i, f„+i-*)] = 0, 

i=0 

where a’s and P’s are constant parameters. The method is called explicit 
if po — 0; otherwise it is implicit. 

Many algorithms of practical interest take the form of Eq. (19). For 
example, with k = 1, ao = —a\ = 1, Po = a and P\ = 1 - a, Eq. (19) 
becomes a one-step trapezoidal method in the form 

(20) y„+i — y n = At[af(y n+ i,t n+ i) + (1 - a)f(y„,t„)]. 

The forward difference method corresponds to a — 0 and is an explicit 
method; and the backward difference method corresponds to a = 1 and is 
an implicit method. 

For linear system, f can be written in the form 
f(y, 0 — Hy + F, 

where H and F are matrix functions independent of y. The stability of the 
multi-step method for constant H can be determined as follows: let 

(21) Yj = axv 3 , 

where a* is the eigenvector of H associated with the eigenvalue A and rj 
is a parameter. Substituting Eq. (21) into Eq. (19) and setting F = 0 
(the particular solution has no effect on stability), we obtain the stability 
equation 

k 

X)(oi + AtAA)i? n+1-< =0. 

i=0 

If the magnitude of all the roots i) of the stability equations is less than 
one for all A, the integration scheme is stable. Usually it is the maximum 
eigenvalue H that controls stability. Dahlquist (1963) showed that 
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• No explicit unconditionally stable multi-step method exists. 

• No 3 rd -order accurate unconditionally stable multi-step method exists. 

• The trapezoidal rule has the smallest error constant among the uncondi¬ 
tionally stable multi-step methods that are 2nd order accurate. 

For one-step method, the stability equation gives 

rj = (1 + (1 — <*)AAt]/(l - aXAt). 

We see that the backward difference method (a = 1) is unconditionally sta¬ 
ble if Re(A) < 0 for all the eigenvalues A of H; while the forward difference 
method (a = 0) is stable only if |1 -t- AAt| < 1 for all eigenvalues. 

Park (1975) proposed a second-order accurate 3-step implicit method, 
which is unconditionally stable with good accuracy in the low frequencies 
and strong dissipation in high frequencies. The Park method is defined by 

k = 3, c*o = — 1, on = 1.5 , ct 2 — 0.6, 

£*3 = 0.1, /?o = 0.6, /?i = &2 = /?3 = 0 . 

Multi-Step Methods for Second-Order Equations. Consider a sys¬ 
tem of second-order equations in the form 

(22) y = fi(y,f)y + fo(y,t). 

Most of structural dynamics equations can be put in this form. The multi- 
step method is defined by (Geradin 1974) 

k 

(23) ^ ^ Qjyn+l—i + ftAffi (y n ^-i_j, tn+1—i)yn+l-i 

i=0 

+ 7 < At 2 fo(y„ + i-<,t„ + i_<)] =0. 


One sees that the Newmark method is a two-step method. The stability 
properties of Eq. (23) can be examined in a similar fashion (Krieg 1973). 

Generally, conditionally stable algorithms require a time step that is of 
the order of the shortest vibration period of the structure (the inverse of 
the highest frequency). Thus one often is forced to use time steps much 
smaller than those needed for accuracy. Therefore, unconditionally stable 
algorithms may be preferable, especially if only low-frequency response is of 
interested. In this case, it would be desirable if the algorithm can effectively 
damp out the high-frequency modes. Assessment of numerical dissipation 
can be found in Hughes (1983). 
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There are many time integration schemes such as the Wilson 0-method 
(Wilson 1968), Rungi-Kutta predictor-corrector method and the a -method 
(Hilber et al 1977). Readers are referred to literature (e.g., Hughes 1987) 
for the properties of various integration schemes. 


18.18. NUMERICAL INTEGRATION 

Derivation of element matrices involves line, area, or volume integral. 
With exception of simple elements exact integration could be very compli¬ 
cated. Numerical integration is essential almost for all practical applica¬ 
tions. We shall discuss briefly the principle of numerical integration and 
give the tables of numerical coefficients for commonly used schemes. 

If a function u(x ) is approximated by shape functions that u(x) « 
]T/i j(x)qj with q’s being parameters associated with the values of u at 
selected points, then the integral of u over a domain — 1 < x < 1 is approx¬ 
imately 

J ^ u(x) dx^Y^ W i c h - 

where 



In the ordinary rules of integration - generally called quadrature formula - 
q's are the values of u at the selected locations. There are many quadrature 
rules. The Gauss quadrature stands out as being generally more accurate 
for a fixed number of integration points and is described below. 


One-Dimensional Integration. The limits of integration can always be 
transformed to become from —1 to +1. If we approximate u by the m th 
order Hermite interpolation, we have 


where 


= I' «(o# * E w>&) + E , 

j=i j= i ^ 

r i = f , w; = h 2j (o dt , 


in which h's are defined in Eq. (18.3:12) and Ws are called 
ing factors. If the stations ■ ■ ■ ,£m are chosen such that 

ish, then 


the weight- 
ail Wt van- 
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Table 18.18:1, Stations and weighting factors of Gauss quadrature. 


No. of 

stations 

fi (Station 
locations) 

weighting 
factors (Wj) 

Order 

Error 

1 

0 

2 

linear 

0 (c 2 ) 

2 

±l/\/3 

1 

quadratic 

0 (e 4 ) 

3 

0 

8/9 

cubic 

0 (c 6 ) 


±\/3/5 

5/9 




and the resulting formula is called the m th order Gauss quadrature. Such 
a quadrature represents a function by a polynomial of degree 2m — 1. The 
degree-of-accuracy of a quadrature is the degree of the highest order com¬ 
plete polynomial that can be integrated exactly by the formula. Hence 
2m — 1 is the degree-of-accuracy of a m-point quadrature and the error of 
the approximated integration is thus of the order e 2m . Table 18.18:1 gives 
the stations and the weighting factors of the Gauss quadrature of different 
order. Any integration of a one-dimensional function can be approximated 
now as the summation of the products of the function’s value at selected 
stations and their corresponding weighting factor. 


Two- and Three-Dimensional Integration. Multi-dimensional Gauss 
integration rules are formed by successive application of the one-dimen¬ 
sional Gauss rule. In two-dimension, for a square domain, we have 


( 2 ) 


/ i / i U ^ ^ dr] ~ / x J2 W i u (^i,v) dr) 




where IT’s are the same as those of one-dimensional integration. For 
Eq. (18.14:15), the integrand B T D e BJ is a n x n symmetric matrix for 
an element of n degrees-of-freedom. The matrix has n(n + l)/2 distin¬ 
guished entries which are functions of £ and rj. We would have to integrate 
all entries individually. A 4-node quadrilateral in plane elasticity has 8 
degrees-of-freedom and would require 36 separate integration for the deter¬ 
mination of k. 

In three-dimension, the integration rule has the similar form 


( 3 ) 


J J J ^u(£,T),Qdtdrid( 

<*)• 
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A m th order Gauss quadrature can correctly integrate any term Cv^ k , 
i,j,k < 2 m - 1 . For integral of Eq. (18.16:17), an 8 -node hexahedron 
for 3-dimensional elasticity has 24 degrees-of-freedom. There are 300 dis¬ 
tinguished entries of B 7 DBJ to be integrated individually. A 20-node 
hexahedron has 60 degrees-of-freedom with 1891 distinguished entries. The 
required number of integration increases rapidly for 3-dimensional element 
of high order. In practice, it is not necessary to use the same number of 
weighting station in all directions of integration. One may use the appro¬ 
priate weights and number of Gauss points according to needs. 

The Gauss quadrature is most popular for the integration of one¬ 
dimensional, quadrilateral and brick elements where the domains of in¬ 
tegration in natural coordinates are — 1 < £ < 1 , —1 < £, 77 < 1 and 
— 1 < £, 77 , £ < 1, respectively. More efficient integration rules can be de¬ 
signed to integrate complete polynomials (Irons 1971) of order 2m— 1. The 
integration takes the form 


( 4 ) 



u(£, 77 , ?) d£ dr) d<; = Au( 0 , 0 , 0 ) 


(1 term) 


+ B[u(—b, 0,0) + u(b , 0,0) + u(0, -b, 0) + ....] (6 terms) 


+ C[u(-c, -c, -c) + u(c, -c, -c) + u(c, c, - c ) +_] (8 terms) 


+ D[u(—d, —d, 0) + u(d, —d, 0) + ... 


+ u(d, 0 , — d) + u(—d, 0 , — d) +-] (12 terms). 


The weighting factors A, B, C, D and the corresponding coordinates are 
listed in Table 18.18:2. From the truncation errors, we see that, to correctly 


Table 18.18:2. Weighting factors and coordinates for Eq. (18.18:4). 


Number of 

integration points 

Coordinates 

Weighting 

factors* 

Order 

Error 

1 


A = 8 

Linear 

0 (£ 2 ) 

6 

b= 1 

B = 8/6 

Cubic 

G(e 4 ) 

14 

b = 0.795822426 

c = 0.758786911 

B = 0.886426593 

C - 0.335180055 

Quintic 

0 (c 6 ) 

27 

fc = 0.848418001 

c = 0.652816472 

d = 1.106412899 

A = 0.788073483 

B = 0.499362002 

C = 0.478508449 

D = 0.032303742 

Hept 

0 ( £ 8) 


* unlisted weighting factors are zero 






Sec. 18.18 


NUMERICAL INTEGRATION 


729 


integrate a given complete order polynomial, the Irons formulae require less 
number of integration points than the Gauss quadrature. 

Problem 18.17. Derive the equivalence of the Irons formulae for square: 
Determine (a) the number of points needed for integration; (b) the weight factors: 
and (c) the corresponding coordinates of the integration points. 


Table 18.18:3. Numerical integration stations and weights for triangles. 


Integration Triangular 

Figure Points coordinates Weights Order Error 



a 


1/3,1/3,1/3 


1 Linear 0(e 2 ) 





a 

1/2,1/2,0 

1/3 

Quadratic 

0 (e») 

b 

0,1/2,1/2 

1/3 



c 

1/2,0,1/2 

1/3 



a 

1/3,1/3,1/3 -27/48 

Cubic 

0 (c<) 

b 

0.6,0.2,0.2 




c 

0.2,0.6,0.2 

> 25/48 



d 

0.2,0.2,0.6 

J 



a 

1/3,1/3,1/3 0.2250000000 

Quintic 

0 (c 6 ) 

b 

^1,^1,6l | 




c 


> 0.1323941527 



d 

6 i,6 l5 ai J 




e 

a 2, &2> &2 'j 




f 

&2 » 0.2 > ^2 / 

> 0.1259391805 



g 

^2 j^2 5 «2 J 





ai - 0.0597158717 
bi - 0.4701420641 
a 2 = 0.7974269853 
6 2 = 0.1012865073 
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Numerical Integration for Triangles or Tetrahedrons. A triangle 

can be treated as a degenerated quadrilateral. Hence, an integral over a 
triangular area can be obtained by the Gauss quadrature. However, it is 
not advisable to use the degenerated integration involving many integration 
stations because a singularity occurs in mapping a triangle to a square, 
which will lead to loss of accuracy. An alternative is to apply the numerical 
integration approach described for one-dimensional integration directly to 
an triangular area 


rl rl~v _ y 

1=1 / ud£ dr] « V'' Wj Uj . 

Jo Jo • 

The limits of integration now involve the independent variables themselves. 
A series of sampling stations, weights and degrees of precision for different 
integration formulae (Hammer et al. 1956, Felippa 1966, Cowper 1973) is 
given in Table 18.18:3. Similarly one can derive the formulae for tetrahe¬ 
drons. The results are presented in Table 18.18:4 (Hammer et al. 1956). 


Table 18.18:4. Numerical integration stations and weights for tetrahedrons. 


Figure 





Integration Tetrahedral 


Points 

coordinates Weights 

Order 

Error 

a 

1/4, 1/4, 1/4, 1/4 1 

Linear 

0( £ =) 


a 

a, P, P, P 

1/4 

Quadratic 

0(e 3 ) 

b 

P, a, p , P 

1/4 



c 

P, P , a, P 

1/4 



d 

P, P, P , a 

1/4 




a = 0.58541021 





P = 0.13819660 




a 

1/4, 1/4, 1/4, 1/4 

-4/5 

Cubic 

0(e 4 ) 

b 

1/3, 1/6, 1/6, 1/6 

9/20 



c 

1/6, 1/3, 1/6, 1/6 

9/20 



d 

1/6, 1/6, 1/3, 1/6 

9/20 



e 

1/6, 1/6, 1/6, 1/3 

9/20 




Required Order of Numerical Integration. Numerical integration is a 

source of errors in the finite element method. The accuracy of integration is 
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proportional (not necessary linearly) to the number of integration points. 
As numerical integration can consume a significant amount of computer 
time, it is of interest to determine (a) the minimum integration points 
required for convergence and (b) the number of integration points necessary 
to preserve the order of convergence of the method, which would result if 
exact integration were used. 

To establish the minimum number of integration points, we first note 
that, for convergence, the integration must be able to reproduce the size 
of the domain as all elements approach zero. This is obviously a necessary 
condition, otherwise the functional will not converge to a constant when 
the integrand of the functional is a constant. This means the error in 
evaluating x J A d£ dr], f v d£ dr) d( by the integration formulae must 
be of the order O(e) or higher. All quadrature formulae cited in the tables 
meet this requirement. Convergence of the finite element method based on 
an integral formulation can occur providing arbitrary constant value of the 
integrand of the functional can be reproduced. Thus, any integration with 
order of error 0(e) suffices. 

The finite element method approximates the functional to the order 
2 (p—m) where p is the order of complete polynomial of the shape functions 
and m the order of differentials appearing in the functional. If the error of 
the integration is at least 0[e 2 ( p-m ) +1 ], no loss of the order of convergence 
will occur. Thus, integration with an order of accuracy higher than the 
approximation of the functional by the shape functions will not change the 
order of convergence of the overall solution. In fact, it will show later that 
using integration of the order higher than actually needed under (b) may 
sometime affect adversely the accuracy of the finite element solution. 

18.19. PATCH TESTS 

We have presented finite element methods involving displacements that 
violate the admissibility conditions of the weak form formulation. The 
general theory can no longer assure the convergence of the weak form so¬ 
lution. Irons (1966, 1966a) first introduced the concept of patch test to 
examine the correctness of the finite element formulation. The patch test 
has now been generalized to determine whether or not an element employ¬ 
ing nonstandard features such as reduced integration, incompatible inter¬ 
polations satisfies the condition for convergence. Even though compatible 
(i.e., all assumed field variables meet the admissibility requirements) dis¬ 
placement, hybrid and mixed elements automatically pass the patch test, 
the assumed field variables may contain deformation modes not called for, 
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one still can use the test to check the correctness of the finite element 
formulation and computer implementation, as Irons originally intended. 



(a) 


1 2 3 



(b) 


Fig. 18.19:1. A two-dimensional patch of elements. 

A patch is a small group of interconnecting irregularly shaped elements. 
Figure 18.19:1 shows two typical 2-dimensional patches frequently used in 
benchmark test (Robinson and Blackham 1979, Hughes 1987). An element 
passes the patch test if its shape functions can represent exactly the linear 
states of deformation for a second order system. Plane elasticity is a two- 
dimensional second order system and the linear states of deformation are 
simply 1, x, and y. 
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For plane elasticity, there are two versions of patch test. Version 1 
specifies the displacements 

(1) u = ai + a 2 X + a 3 y, v = a 4 + asx + a$y 

at all exterior nodes [nodes 5-8 for Fig. 18.19:l(a) or nodes 1-4 and 6-9 
for Fig. 18.19:1 (b)] for arbitrary a’s. There are six independent choices of 
these constants. Thus one has to solve six displacement-boundary-value 
problems in order to examine all deformations of the six linear states. 
The patch test is passed if, in the limit of the element size approaching 
zero, (1) the displacement solutions at all interior nodes [nodes 1-4 for 
Fig. 18.19:l(a) or node 5 for Fig. 18.19:l(b)] are exact, i.e., give the values 
u, v of Eq. (1) evaluated at these nodes, and (2) the displacement gradi¬ 
ents, consequently the strains and stresses, are exact within each element, 
i.e., du/dx = < 22 , du/dy — 03 , dv/dx = as, dv/dy = a6. 

For strain and stress to be exact, it implies that the interior nodal forces 
are zero, i.e., 

(2) f q T (d e h) T creL4 = 0 or 

all patch elements 



for any arbitrary q’s associated with the interior nodes of the patch, where 
<r r (= [ cr xx cr yy cr xy \) is constant. This is equivalent to the requirement of 
self-equilibrium at all interior nodes. If the generalized coordinates q'j with 
their associated shape functions /i' are defined within a single element only, 
from Eq. (2), we obtain 

(3) deh'd^q'^O 

for the element. This is a necessary condition for any arbitrary q' param¬ 
eters and their associated shape function to pass the constant strain patch 
test. Parameters associated with bubble functions or some incompatible 
shape functions are such type of generalized coordinates. 

In version 2, the test involves two parts. First, rigid body modes are 
tested. For a patch in plane elasticity as shown in Fig. 18.19:2, the rigid 
body motion is specified 


u = ai + a, 2 y , 


v — a 3 — a^x 
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at all nodes, interior as well as exterior. The resulting nodal forces must be 
zero at all nodes for arbitrary a’s [Eq. (2)]. In the second part of the test, 
we impose a minimum number of exterior constraints, just enough to re¬ 
strain the rigid body motion, and apply appropriate loads to the remaining 
exterior nodes that will produce a constant state of stress over the patch. 
The patch test is passed if the finite element solution achieves the desired 
constant stress for all elements. 

The second part of the test requires the calculation of the appropriate 
nodal loads corresponding to the desired state of deformation. These loads 
are applied to the unconstrained exterior nodes. This is quite easy for linear 
deformation. For plane elasticity, the linear deformation that satisfies the 
displacement constraints shown in Fig. 18.19:2 is 


u = e xx x + ae xy y , v = e vv y + (1 - a)e xy x , 


where e’s are the three independent constant engineering strains and a is 
an arbitrary constant. The nodal forces at the exterior nodes are 


E / » T 

all patch elements 


Tlx&xx T Tly(Txy 

Tlx@xy T 


ds , 


where er’s are the stresses associated with the three strains e xx , e yy , e xy 
given above, n’s are the components of a unit normal to the exterior bound¬ 
aries and dA ae is the portion of element boundaries coinciding with the 
exterior boundaries of the patch. The h ’s involve only the shape func¬ 
tions associated with nodes on dA ae . For elements with straight edges, 
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n x a xx + n y a xy and n x a xy + n y a yy are constant. The integration above can 
be carried out easily. 

A patch test can be applied to a single element. In this case, the test can 
determine whether strains within the element are correctly evaluated from 
nodal displacements or forces but it cannot determine whether the element 
is compatible with its adjacent element. Therefore, a single element patch 
test is less useful than a multi-element patch test. However, a single element 
test is good to detect the existence of spurious zero energy modes, which 
are usually induced by the approximate integration of element matrices. 
Zero-energy modes will be discussed in more detail later. 

We can similarly establish higher order match tests, e.g., requiring that 
the shape functions exactly represent quadratic states of deformation. This 
is needed for higher order equations such as bending of thin plates and 
shells. In this case, the curvatures and twists are proportional to second 
derivatives of the out-of-plane displacement. Quadratic test is a minimum 
requirement for fourth order equations since quadratic deformation repre¬ 
sents only constant curvature and twist. One generally has to solve many 
more boundary-value problems for higher order patch test. 

If the material is homogeneous, the patch test will be independent of 
element size for plane elasticity. 

18.20. LOCKING-FREE ELEMENTS 

The incorrect interpolation of a displacement field and its derivatives 
from its values at nodes causes the failure of patch tests. There are two 
sources of errors. Approximation for higher order functions occurs nat¬ 
urally since the finite element method uses mostly polynomials of finite 
order. Errors are also introduced through the nonlinearity of isoparametric 
coordinate transformation (Tong and Rossettos 1977, MacNeal 1994). For 
example, for an 8-node quadrilateral element, the shape functions contain 
only complete quadratic polynomials in £, T). However, the transformation 

8 

x = Xjhj(£, rj ), etc., 
l=i 

given in Eq. (18.13:3) includes all quadratic and some higher order terms of 
£,r). Then x 2 , xy, y 2 will involves terms like £ 2 ?? 2 , rjf 3 , etc. Thus the shape 
functions cannot represent x 2 , xy, y 2 exactly within a general quadrilateral. 

Interpolation failure has other consequences. Among the more serious 
ones is the phenomenon of locking, a condition of excessive stiffness. Lock¬ 
ing often occurs in bending caused by spurious shear. It also occurs in 
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problems involving incompressible or nearly incompressible materials re¬ 
sulted from incorrect representation of dilatation. Discussions of the sub¬ 
ject can be found in Hughes (1987), Babuska and Suri (1990), MacNeal 
(1994). Babuska’s work identifies parameters characterizing locking and its 
effects on convergent rates. 

Increasing the order of shape functions generally reduces the disorder 
due to lower interpolation errors and thus reduces the seriousness of locking 
or rids it completely for regular elements. Unfortunately, this is usually not 
true for elements involving nonlinear coordinate transformation. For exam¬ 
ple, isoparametric transformation introduces quadratic interpolation errors 
for the 8-node quadrilateral element (Tong and Rossettos 1977). Barlow 
(1989) gave a comprehensive analysis of errors due to shape distortions for 
8-node quadrilaterals and 20-node bricks. 

As mentioned before, locking is the result of incorrect representation 
of some, not all, strain components. However, there are locations within 
the element where the values of strains are correct. Thus, evaluating the 
incorrectly represented strain components only at the locations where their 
values are correct could eliminate locking. This procedure is, therefore, 
called selectively reduced integration. If one reduces the number of inte¬ 
gration points for all strain components, the procedure is called uniformly 
reduced integration. Uniformly reduced integration generally introduces 
spurious zero energy modes, i.e., nonrigid body motion that contributes 
zero energy, which will be discussed in the next section, while selectively 
reduced integration may retain sufficient number of evaluations of strain 
components to prevent spurious mode. Reduced integration reduces the 
number of evaluation in the integration. 

Reduced integration for 4-node quadrilateral was first introduced by Do¬ 
herty et al. (1969). More discussion of selectively reduced integration can 
be found in MacNeal (1978), Malkus and Hughes (1978), Hughes (1980), 
Belytschko et al. (1981, 1984). Hybrid and mixed formulations are alter¬ 
natives to circumvent locking. The full potential of the hybrid and mixed 
approaches is yet to be explored. We shall illustrate shear and dilatation 
locking by examples below and discuss locking-free elements. 

Shear Locking. Consider a 4-node orthotropic rectangle (—L < x < L, 
— 1 < y < 1) subjected to bending shown in Fig. 18.20:1 A bending dis¬ 
placement field is 

(1) u = xy , v = —x 2 /2, 

which gives a constant curvature along the x-axis. The corresponding 
strains, stresses, and strain energy per unit thickness for the plane stress 
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Fig. 18.20:1. Four-node rectangular element subjected to in-plane bending. 


element are 


ex - ti^x - y 1 Gy - — 0, 'Yxy — T H ) 

Ox — ExV/i}- VxyVyx) t Oy = E x V xy y /(1 VxyVyx) i ®xy — 0, 


W 


-u:iy 


edxdy 


2E X L 


3(1 L'xyVyx) 


where E x is Young’s modulus and v xy , v yx are Poisson’s ratios. The 4-node 
finite element representation (bilinear) of the displacement field is 


( 2 ) 


«e = xy , Ve — -L 2 /2 . 


One can verify that Eqs. (1) and (2) have same displacements at the four 
corner nodes. The finite element representation gives 


e x — y i Gy — 0 , 'J'xy 

E x y 


<?x = 


1 t; V 


xy^yx 
L 


„ ^x^xyV 

Oy — . 

1 — V xv V, 


W e 


=\u:° 


edxdy — 


xy u yx 

2E X L 


3(1 V X yl/y 


= G XV X , 


1 + _ V *» V V*W 

■&X 


where G xy is the shear modulus. Obviously, the shear strain, the shear 
stress and the strain energy per unit thickness are not correct. The ratio of 
the strain energy of the finite element representation to the correct strain 
energy is 1 + ^f*(l — v xy v yx )L 2 , which represents the ratio of finite element 
bending stiffness to the correct bending stiffness. The ratio is proportional 
to L 2 and thus is large even for a moderately slander element (moderately 
large L). It is also large if G xy /E x is large, even if the slander ratio is of 
order 1. The high stiffness of the element is mainly from the incorrect shear 
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strain. This phenomenon is called shear locking , which precludes the use 
of regular 3-dimensional displacement elements to represent thin plates or 
shells by simply reducing the dimension of the 3-dimensional element in the 
thickness direction. This is due to the characteristics of bending deforma¬ 
tion that the maximum displacement ratio, Max (v/u), is the same order 
as the slander ratio of the element. The situation is further exacerbated 
if G xy /E x is large for orthotropic materials. More on the elimination or 
minimizing shear locking in plate and shell applications will be discussed 
in the next chapter. 

An effective remedy to shear locking is to use the value of ~/ xy at 
x = y = 0 for all integration points (Doherty et al. 1969), since at 
x = y = 0, the finite element representation of shear is correct for such de¬ 
formation. This technique is called selectively reduced-integration because, 
in effect, we employ a single point to evaluate the shear strain energy and 
the 2 x 2 or higher order Gauss integration rule to evaluate the strain en¬ 
ergy contribution from e x and e v . Complication arises if the shear strain 
is elastically coupled to the direct strains, i.e., D 13 , D 23 0, where D's 

are the components of the elastic modulus matrix defined in Eq. (18.14:2). 
In evaluating (Di 3 e x + D 2 ^e y )^ xy , one would have to use the shear strain 
value at the center for those at the 2x2 Gauss points. Another remedy is 
to use nonconforming element by adding a deformation mode proportional 
to x 2 (Bazeley et al. 1966, Taylor et al. 1976) in the y-direction. One can 
also use the hybrid formulation to avoid the spurious shear strain modes 
(Razzaque 1973). 

Similarly, one can show that the 8-node rectangle also locks, but to a 
much lesser degree. For the bending deformation 

u = x 2 y , v — — x 3 /3 , e x = 2 xy , e y — e xy — 0 

the finite element representation is 

u = x 2 y, v =—L 2 x/ 3, e x = 2xy, e y — 0, ^ xy = x 2 — L 2 /3. 

The dominant locking mode is a result of the spurious shear strain. Since 
the finite element representation of the shear strain is correct at the Gauss 
stations (G = ±l/\/3) of the 2x2 integration rule, the problem can be 
corrected by using the 2x2 uniformly reduced integration (Zienkiewicz 
et al. 1971) for all strains. The situation can be corrected also by the 
2x2 selectively reduced rule for the shear energy and the 3x3 integra¬ 
tion rule for the bending energy. Note that uniformly reduced integration 
introduces rank deficiency. If the shear strain couples elastically with the 
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tensile strains, one would have to extrapolate the shear strain from the 2x2 
Gauss points to the 3x3 Gauss points (& = 0, ±%/035)- Unfortunately the 
extrapolation makes the element fail the patch test. However, it generally 
still provides good results in many practical applications. 

The 8-node brick element also locks when its dimension in one direction 
is much less than the other two dimensions. Shear locking can be avoided 
by using one-point integration for the shear strain energy. The constant 
strain triangle experiences interpolation errors for all quadratic or higher 
order displacements. As a result, it locks for in-plane bending deformation. 
The constant strain triangle is of little practical use for such situation. 
The 6-node triangular element does not lock for any quadratic deforma¬ 
tion because it can correctly interpolate such displacements. However, for 
elements with curve sides, locking returns due to errors in the nonlinear 
isoparametric transformation. 



(a) Paralleglogram ( b ) Isoparametric 

square element 


Fig. 18.20:2. Parallelogram element. 

Reduced integration often fails to fix the locking of general distorted 
elements. For example, consider the parallelogram shown in Fig. 18.20:2 
with the displacement field (MacNeal 1994) 

(3) u = xy , v——x 2 /2. 

The strains are e x = y and e y = ^ xy — 0. The transformation to the 
natural coordinates gives 

x = + i) tan 9 , y = y , 

u = L£ri 4- t) 2 tan#, v = -(L 2 £ 2 + 2L£r;tan# + rf tan 2 9)/2 . 
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The 4-node isoparametric finite element representation is 

u = L£ri -f tan 8 , v — -(L 2 + 2L£r) tan 0 + tan 2 0)/2 . 
with the strain field 

e x = rj , e y = tan 0 {—+ r) tan 8) , ^ xy = L£ — 2r] tan 8. 

Clearly 7 xy is in error and will lead to locking if L is large compare to one. 
In the mean time the error in e y will also be significant and contribute to 
locking if L tan 8 is large. The error in strains cannot be eleminated by 
the selectively reduced integration of j xy at £ = rj = 0. On the other hand, 
if one evaluates all strains at the origin to eliminate the error of e y and 
j xy , the element would have no stiffness at all for in-plane bending. Similar 
results can be shown for a 4-node trapezoid. 

Mixed Formulation. An alternative to reduced integration is to use 
mixed formulation. To simplify the discussion, we shall consider isotropic 
materials only. We separate the functional of Eq. (18.14:6) into three parts 

(4) n e = vide + n„e _ n /e 

where IId e , II ae , II j e are the distortion energy, the shear energy and the 
external work, respectively. For plane stress, 


(4a) 

n de 

J A 2(1 - v)' 1 + y2y + 2l2U ' xV '^ dxdy ’ 

(4b) 

s = 

J J (u,y + V' X ) 2 dxdy , 

(4c) 

n fe — 

/ (ub x + vb y ) dxdy + / (uT x + vT y ) dxdy 

JA C JdAet 


Consider a 4 -node rectangular element of height 21 and length 2 h. The 
displacements can be written in terms of natural coordinates (x = h£, y — 
Irf) as 

( 5 ) u = aj + a 2 £ + 0317 -I- a^r), v = + b 2 £ + b 3 rj 4- 64^77, 

where 


= (ui + U2 + U 3 + 114)/ 4 , 
a 3 = ( U\ - u 2 + «3 + «4)/4 


a2 — (-ui + u 2 + u 3 - u 4 )/4 , 
a 4 = (ur - ui +u 3 - Ui)/A , 
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in which u's are the nodal values of u. Similarly one can express 6’s in terms 
of the nodal values of v. Prom the strain displacement relation, one finds 

fn\ 03 &2 . 0,4 64 

(7) 7xy = T + _ + _ C + _^. 


The shear energy within the element is 



At the high aspect ratio (h/l — > 00 ) limit, the shear energy approaches zero. 
This is equivalent to forcing 

(9a) 03 /Z "i - ^ 2 /h —► 0, 

( 9 b) 04 —* 0 , 64 —> 0 . 

These three conditions together over constrain the system and lead to shear 
locking. Prathap (1993) argued that if a constraint, such as Eq. (9a), in¬ 
volves coefficients from all related field variables, it can be correctly en¬ 
forced in the limit. The representation of the strains by those terms (like 
0.3, 62 ) are said to be field-consistent. If a constraint, such as Eq. (9b), 
has no contribution from one or more related field variables, it may incor¬ 
rectly constrain the deformation and cause locking. Such terms are called 
field-inconsistent representation. 

We can remove the spurious constraints using the mixed or hybrid for¬ 
mulation. Let us replace the shear energy n se of Eq. (4b) by the Reissner- 
Hellinger functional [Eq. (10.10:9)] 

(4d) II se = jG 7 *y(u y + v }X ) - jjl y dxdy , 

in which j xy is a new independent field variable for the element. Prom the 
stationary condition of n je with respect to y xy , 

(10) <SII se = J G{u, y + u x - %y}5% y dxdy = 0, 

we can establish the relation between % y and u iV -f v }X . Judicious se¬ 
lection of the shear strain % y can remove the spurious constraint(s) and 
alleviate locking. An element with j xy being field-consistent is called a 
field-consistent element, otherwise a field-inconsistent element. 

4-Node Field-Consistent Rectangular Element. For a 4-node rect¬ 
angle, assuming constant % y within the element, from Eqs. (10) and ( 6 ), 
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one can show that 

/i in - °3 , &2 

(11) 7xy = y + j or 

1 , . 1 . 

Ixy = ^j\~ U 1 - «2 + u 3 + m 4 ) + — (—VI + v 2 + V 3 - u 4 ), 

A substitution of j X y into Eq. (4d) yields 

/ION rr OCU ( ~ U ' -W2 + U3 + W4 , “Vl + V 2 + U 3 - V 4 

(12) n.. = 2C«( i -j-+- Th - 

Using Il se above, together with Tide., II j e of Eqs. (4a) and (4c), one can drive 
the element matrices. Equation (12) is equivalent to evaluating the shear 
strain energy Yl se from Eq. (4b) by one point integration. The modified 
element performs well for h/l 3> 1 in modeling bending of thin beams. 
For stress calculation, one should use field-consistent % y since the shear 
strain derived from the assumed displacements [Eq. (7)J often shows rapid 
linear variation within an element. Note that Wilson’s 4-node incompatible 
rectangular element discussed in Sec. 18.14 is field-consistent. 

Generally field-inconsistent quadrilateral elements lock badly in model¬ 
ing flexure of thin beams. In this case, optimum field-consistency is yet to 
be found. The modified version of Wilson’s 4-node incompatible element 
(Sec. 18.14) provides satisfactory solution. Field-consistency may not guar¬ 
antee locking-free as the shear strains of constant triangle and tetrahedron 
are field-consistent and yet they still lock in thin beams, plate and shell 
applications due to insufficient deformation modes. 

8-Node Field-Consistent Rectangular Element. For 8-node elements, 
the displacements are 

u = ai + a 2 £ + a 3 r) + a^rj + a 5 £ 2 + a 6 r] 2 + a 7 £ 2 rj + a s £r] 2 , 

(13) 

v = b x + & 2 £ + 6377 + b 4 ^r) + 6 5 £ 2 + b 6 r } 2 + b 7 £ 2 r} + 6 8 £ t ? 2 , 

where a’s and 6’s can be expressed in terms of the nodal values of u and v, 
respectively. The shear strain derived from the displacements is 

/1 .\ a 3 b 2 (a 4 2b 5 \ (2a 6 b 4 \ a 7 2 

( 14 ) 't: = T + -h + {T + -) !i + (-T + -h)’’ + T ( 

/ a s 67 \ , ^8 2 

+2 (t + t■ 
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The spurious constraints are 


a,j —> 0 , bg —> 0 . 


Expanding Eq. (14) in terms of the Legrendre polynomial 



and using its orthogonal properties, one can show that the field-consistent 
shear strain is 

(16) i'*»-T + X +3 7 + 3 A + (T + Tj f; 

, {2a 6 & 4 \ fag b 7 \ 

+ {— + h)’ l + i { T + A j ^ ■ 

Equation (16) is also the field-consistent shear strain of 9-node rectangular 
elements. 

The shear stress should be calculated from % v . Field-consistency usu¬ 
ally gives smoother stresses in modeling flexure of thin beams. The shear 
stress calculated from the assumed displacements often shows rapid quad¬ 
ratic oscillation within an element (Prathap 1993). 


8-Node Field-Consistent Brick Element (Fig. 18.15:2). In 3-dimen¬ 
sional linear theory of isotropic materials, the shear strain functionals are 


(17a) 

n se — 

f G 

J 2 " fr** + ^ + 7 ?«) dxdydz , 

(17b) 

n Je = 

/ ^[(7xy7xy T 7yz7yz + 7zx7zx) 

Jv c 



~ \(ll y + fyz + 72,)] dxdydz , 

for displacement and mixed formulations, respectively, where 7 ’s are shear 


strains derived from the assumed displacements. For an 8 -node brick 
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(2 h x 21 x 2d), the trilinear displacements are: 

( 18 ) u = ai + a 2 £ + a-iTj + a 4 c + a 5 £t] + a 6 r]<; + a 7 c£ + a s £r }<;, 
v — bi + b 2 $, + b 3 7] + ... , w = ci + c 2 £ + c 3 r) + ... , 


where a\ i>’ and c’s are related to the nodal values of u, v and w, respectively. 
The shear strain y xy derived from the displacements is 


7*y - — + — + [ ~ + 


a 6 , 


. «5 , . b 5 as bs 

s+ -rt + 17 * 1 +-rfr +Tw¬ 


in the limit of 7 xy —> 0 in bending applications to slender beams, all poly¬ 
nomial coefficients vanish. Of all the constraints, 


U5 —> 0, ag — * 0 1 bg —> 0 , bg —> 0 , 


are spurious and can cause locking. One can similarly obtain other spurious 
constraints by considering y yz and y zx . It can be shown that the field- 
consistent shear strains are 


(19) 



64 , c 3 

~ 1 + T + 





__ C2 1 

7zx — , + 


^4 . / £5 \ 

7 IT H) 


Another alternative to reduce locking is to use incompatible elements. In¬ 
troducing bubble functions in the displacements, one can write 


(20) u = ai + a 2 £ + a 3 r) + a 4 c + a 5 £ rj + a 6 r)<; + a 7 z£ + a 8 £?jc 

+ ^9(1 — f ] 2 ) + a io(l — <T 2 ) + ^11(1 — £ 2 )) 

and similar v and w. The last three terms are added for mitigating dilata¬ 
tion locking to be discussed later. It can be shown that the displacement 
in Eq. (20) is incompatible but the derived shear and dilatation strains are 
field-consistent. 


27-Node Brick Element. The 20-node brick element performs poorly in 
slender beam, thin plate and thin shell applications. It is difficult to find 
the field-consistent strains that will make the element robust in those appli¬ 
cations. For a 27-node brick as shown in Fig. 18.15:2, the displacements are 
triquadratic functions of the natural coordinates £, rj, £. The derived shear 
strains 7 ’s are also triquadratic. It can be shown that the field-consistent 
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shear strains are bilinear in £, rj and quadratic in C for Ixy, bilinear in rj, ( 
and quadratic in £ for 7 yz , and so on. Similar to Eq. (15), if 7 xy is expanded 
in terms of the Legendre polynomial of £, 1 ?, from orthogonal properties of 
the polynomial, then j xy is in the form similar to Eq. (16) which involves, 
the zeroth and first order Legendre polynomials in £, rj. Similarly one can 
derive 7 yz and j zx . The proof is left for the readers. 

Using the field-consistent 7 ’s in Eq. (17b) is equivalent to evaluating 
the shear strain energy from Eq. (17a) with the 2x2x3 integration rule 
for j X y, the 2x3x2 rule for -y yz and so on. Again stresses derived from 
the assumed displacements shown rapid quadratic oscillations within the 
element. Thus stresses should be calculated from the field-consistent 7 ’s at 
the nodes. 

If the edge and surface nodes of the brick element (nodes 9 to 26, 
Fig. 18,15:2) are not located at the middle of the edges or the center of the 
sides, the isoparametric transformation of the coordinates involves nonlin¬ 
ear mapping. The 7 ’s or the integration rules derived in the last paragraph 
are no longer field-consistent. Establishing proper 7 ’s become formidable. 
Using Eq. (17a) with the 2 x 2 x 2 integration rule for all 7 ’s is simple and 
practical even at the risk of introducing zero energy modes, which in some 
rare occasions may lead to poor results. Stresses should be evaluated at 
the integration points and extrapolated to nodes. 

Locking is a result of interpolation error. Higher order elements usually 
do not lock or locks to a lesser extent. Selectively reduced integration does 
wonder in removing shear locking for rectangular elements but is ineffec¬ 
tive for general quadrilaterals as isoparametric mapping introduces spurious 
strain modes other than shear (MacNeal 1990). One may have to resort 
to the field-consistent approach or incompatible elements to reduce lock¬ 
ing. The effective of reduced integration being limited to rectangles may 
not be serious in practice. Normally it is the spurious out-of-plane shears 
that cause locking in bending of thin beams, plates and shells. In practical 
applications, the sides normal to the midplane are generally rectangular 
or nearly rectangular for which shear locking can be effectively eliminated 
by selectively reduced integration. We shall consider the general distorted 
brick elements for thin shell applications in the next chapter. 

Problem 18.18. Consider a trapezoid as shown in the figure with the 
displacements given in Eq. (18.20:1). The transformation of Cartesian coordi¬ 
nates to the natural coordinates is 


x — L£( 1 - arj), y = 77 , where 0 < a < 1 . 
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■2L(1+a)- 


Pig. P18.17. Trapezoid. 

(a) Determine the finite element representations of the displacement and strain 
fields in terms of the natural coordinates. Prom the result, show that locking 
cannot be eliminated by reduced integration, (b) Compute the ratio of the strain 
energy of the finite element representation to the correct strain energy of the 
element. 


Dilatation Locking (MacNeal 1994). The problem is commonly asso¬ 
ciated with nearly incompressible materials, i.e. v —> 1/2. Consider the 
in-plane bending of a rectangle with 

u — xy , v = -x 2 /2 — uy 2 / 2(1 - u) 


e x - V, e y = -vy/( 1 - ") - 
( 21 ) a x = Ey/{ 1 - h 2 ) , a y = 0 , 

W = l f f cr T edxdy = 
2 J_ i J~ L 

in plane strain. The dilatation is 


7 xy — 0 , 

&xy — 9 ) 
2 EL 

3(1 - v 2 ) ’ 


e = e x + e y + e z = (1 - 2v)y/(l - v ), 

which tends to zero as u approaches 0.5. 

For 4-node rectangular element the deformation is represented by 

u = xy , v — —L 2 /2 — vj 2(1 — v), 

e x — y i Cy — 0 , 7 xy ) 

£1(1 — v)y Ei/y Ex 

ax = (1 +p)(l -2 h) ’ ° v = (1 +«/)(l -2 v) ’ axy ~ 2(1 + v) ’ 

e = e x + e y + e 2 = y , 

T „ 2EL (l-i/ L 2 \ 

e ~ 3(1 + v) \1 -2v + 2 ) ’ 



Sec. 18.20 


LOCKING-FREE ELEMENTS 


747 


in which the dilatation is independent of Poisson’s ratio. This error in 
dilatation magnifies the strain energy by a factor proportional to 1/(1 - 2 v), 
which is large as v approaches 0.5. The error in the shear strain thus 
contributes to high stiffness if L is lange. 

Nagtegaal et al. (1974) introduced a strain-projection scheme to treat 
the nearly incompressible case. The strain-displacement matrix B as de¬ 
fined in Eq. (18.16:12) can be written in the form 

(22) B = [Bf v + Bf 1 Bfj ev + Bf • • • Bp v + Bfj, 

where p is the number of nodes of the element and B/ ev , B/ 1 ’ denote, re¬ 
spectively, the deviation and dilatation parts of B associated with node i , 
and are defined as 


(23) Bf ev = i 


B x i Byi B zi 
B x i Byi B zi 

B X i Byi B zi 

0 0 0 

0 0 0 

0 0 0 

d on i 

B ai = — , a = x,y,z. 



2B X i 

— Byi 

-Bzi 

-B X i 

2 Byi 

— B Z i 

-B xi 

— Byi 

2 B zi 

Byi 

B X i 

0 

0 

B zi 

Byi 

B Z i 

0 

Bxi 


For nearly incompressible materials, one modifies the values of Bf* 1 by 
approximating it as a constant equal to its value at the center of the element 
or its average value over the element. One can also approximate Bby 
the interpolated function that interpolates the values of B) 1 ' 1 at the Gauss 
points based on a reduced integration rule. Unfortunately, these techniques 
cannot eliminate dilatation locking completely. 
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An alternative to remove dilatation locking is to use the mixed or hybrid 
formulation. Again we shall use isotropic materials to illustrate the idea. 
Separating Tide of Eq. (4a) into two parts, 

(24) n de = U ve + n ae , 


where n ae , live are the strain energies associated with the deviatoric strains 
and the dilatation e, respectively, i.e., 


(24a) n oe 

(24b) n ve 
(24c) U ve 


E 


/,! 

L 


q 6 


1 

3 6 


2 J Ve 3(1 - 2v) 
E 


dxdydz , 


e 2 dxdydz (displacement model), 


/ o/i fee - \e 2 J dxdydz (mixed model), 

Jv t a(l — zv) \ 2 ) 


in which e — «, x + v iV + w tZ . In addition to u, v and w , e is now another 
independent field variable. Prom the stationary condition of Eq. (24c) with 
respect to e, 


(25) 



1 

3(1 — 2v) 


Se(e — e) dxdydz 


= 0, 


we can determine e. 


8-Node Brick. For an 8-node element, from Eq. (18), we can calculate 

hr. 


02 , h , c 4 

e = — — 1 —:—I -J "b 

h L a 




«8 . ^8 t . c 8 t 
+ ~r w + t + -j ■ 


As the material approaches incompressible, all coefficients of the equations 
above tend to zero. Following the field-consistent argument, we conclude 
that, of all the constraints, only 


«2 &3 C4 

h + l + d 


0 


is genuine while all others are spurious. Assuming constant e, from Eq. (25), 
we find the field-consistent dilatation: 


e = 


0.2 H . 04 

¥ + 7 + 7’ 
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which is in terms of the nodal values of u, v and w. A substitution of e into 
Eq. (24c) yields 


1 8Elhd / a 2 b 3 c 4 \ 2 
llve ~ 2 3(1 - 2u) \ h + l + d ) ' 

This is equivalent to evaluating the dilatation energy from Eq. (24b) by 
one-point integration. Using U ve above, together with U ae of Eq. (24a) 
and n se of Eq. (17a) or (17b), we can establish the element stiffness matrix 
in terms of the nodal values of u, v and w. 

This element is able to respond to nearly incompressible deformation. 
However, this simple dilatation is not sufficient to represent flexure action, 
which involves linear strain variation. A remedy is to introduce in the 
displacements incompatible bubble functions such as the last three terms 
in Eq. (20). The dilatation derived from the assumed displacements is field- 
consistent and can be used directly in Eq. (24b) to construct the element 
stiffness matrix. Such an element is free from dilatation locking and able 
to represent bending. 

27-Node Brick. The dilatation e of a 27-node brick is a triquadratic 
function of the natural coordinates. It can be shown that the constraints 
associated with £ 2 , ?? 2 and C 2 are spurious. Thus the field-consistent dilata¬ 
tion is simply a trilinear function of £, rj, Expanding e in terms of the 
Legendre polynomial in £, 77 , ( and dropping all the terms proportional to 
1 — £ 2 /3, 1 — rf/3 and 1 — c 2 /3 give the field-consistent e. 

More on incompressible and nearly incompressible materials will be dis¬ 
cussed in Sec. 19.6. For distorted bricks, field-consistent dilatation is yet 
to be determined. 

Problem 18.19. Use the displacement field in Eq. (18.20:21) to examine the 
dilatation locking of a 4-node rectangular element in plane stress. 

Problem 18.20. For 27-node 2h x 21 x 2d brick elements, the displacements 
can be written in the form 

222 

«= “yfcSVc*. «= 51 . w = 5Z 

i,j,k =0 i,j,k=0 


where a\ 6’, c's are constants in terms of the nodal values of u, v, w. Derived the 
field-consistent shear strains. Hint: 
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ij,fc=0 ' ' 

= 52 f 1 ^ a °J' fc + g a 2jfc^ + a lj*£ + a 2j*: _ 3^| + ' • • • 

18.21. SPURIOUS MODES IN REDUCED INTEGRATION 
(MacNeal 1994) 

MacNeal defined a working definition of reduced integration. If p is the 
degree of complete polynomials in the element shape functions, the Gauss 
integration with p + 1 or more points in each direction is full and the in¬ 
tegration with p points or less in a direction is reduced. Reduced order 
of integration has many benefits. Obviously, it reduces computation time 
as the number of multiplication in the calculation of element matrices is 
proportional to the number of integration point. In the last section, we 
showed that reduced integration is a potential remedy for many locking 
problems resulting from interpolation failure. Barlow (1968, 1976) noticed 
that strains computed at the reduced-order Gauss points in rectangular 
elements were more accurate than strains computed at other points. Un¬ 
fortunately, reduced integration has its shortcomings. For one, it is less 
accurate and can increase the integration error. Also, more seriously, it 
may introduce spurious zero energy deformation modes , which can destroy 
the accuracy of the finite element analysis. 

The spurious modes are the eigenvectors of the stiffness matrix associ¬ 
ated with zero eigenvalue but nonzero strains. They are distinguished from 
a rigid body mode as the latter is a zero eigenvector associated with a zero 
strain state. Since strains are evaluated at integration points, a non-zero 
strain will give zero strain energy if the strain is zero at those points. Thus, 
the number of strain evaluations at all integration points represents the 
number of constraints for the occurrence of spurious modes. Let S e and S s 
be the number of strain evaluations and the number of independent strain 
states provided by the nodal displacements, respectively. Spurious modes 
occur if S e < S s . The difference between the two quantities is the maximum 
number of spurious modes that can exist for the element. The S c equals the 
number of strain components times the total number of integration points. 
In plane elasticity, there are 3 strains. We have 

Se = 3p 2 , 
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for p x p integration and 

S e = 2(p 4-1) 2 + p 2 , 

for selectively reduced integration with (p + 1) x (p + 1) integration for 
direct strains and p x p for shear. For 3-dimensional elasticity, the number 
of strain components is 6 and S e = 6p 3 for p x p x p integration. 

In the mean time, S 3 equal to the total number of degrees-of-freedom 
minus the number of rigid body modes. The 8-node serendipity element in 
plane elasticity has 16 degrees-of-freedom and 3 rigid body modes. There¬ 
fore, S a = 13. If we use the 2x2 reduced integration rule, S e = 12. Then, 
there will be at least 1 spurious zero energy mode. 

Spurious Modes in Quadrilateral Elements. For a 4-node plane 
quadrilateral element, S a = 5(= 8 — 3). The 2x2 Gauss integration 
provides 12 strain evaluations so that there will be no spurious modes in 
the element. On the other hand, S e — 3 for one-point integration. Thus, 
the element will have at least two spurious modes, which are u = £r?, v — 0 
and u — 0, u = £tj. Similarly, one can show that, for an 8-node brick 
with one integration point, there are 12 spurious modes of which one of the 
displacements u, v, or w equals £r/, r](, ££, or (,rj( while others are zero. 

We can find the spurious modes for higher order Lagrange elements un¬ 
der the p x p Gauss integration rule. Consider a 2-dimensional displacement 
field in the form 

(1) u = f P (0fp(v) , v = 0 

for a (p + l) 2 -node quadrilateral elements, where / P (0 is a Legendre poly¬ 
nomial of degree p, which vanishes at the p Gauss integration points. The 
displacements u and v vanish at all the Guass points for p x p integration. 
So are the corresponding strains 

e x = u,x = fp($)fp(nKx + fp(Z)fp(v)n,x , 

Cy ^\V ^ 1 

Ixy = Uy + V tX = f' p {Z)Um,y + fp(0fp{v)v,y ■ 

The displacement field given in Eq. (1) is a spurious mode. The other 
spurious mode is given by the displacements of Eq. (1) with u and v inter¬ 
changed. 

Another type of spurious mode is given by 

« = fp(0fp(v)i* - fp(0f' P (v)v, x , 

V = fp(0f P (v)iy - f P ($)fp(vhy , 


(2) 



752 


FINITE ELEMENT METHODS 


Chap. 18 


provided that the isoparametric mapping is linear, i.e., £ iX , £ y , r) tX , r] ty are 
constant. This is the case of rectangle or parallelogram. For example, for 
an 8 -node (p = 2) rectangle, = rj tX = 0 and £, X tV,y are constant. The 
spurious mode is 

« = ^(’? 2 -l/3)^x, v = - l/3)v, y , 

where the Gauss integration points are £,77 — ±1/%/3. 

Spurious Modes in Hexahedral Elements. For (p+l) 3 -node Lagrange 
hexahedral elements in 3-dimensional elasticity, one type of spurious modes 
of p x p x p reduced integration is in the form 

u = /p(£)/p(»?)( a 1 + ''' + a p+iS p ) + fp{r])f p (<;){a p+2 +-f a 2p+ i^ p_1 ) 

+ fp(s)fp(0( a 2p+2 + ' ' ' + a 3p+lV P *) 

V = 0 , W = 0 , 

where the isoparametric mapping is linear and a’s are constants. The other 
types of spurious mode are given by 

u = [fp(Z)f P (vKx - f P ( 0 fp(v)v,x}i.a 3p+ 2 + • • • + 04 p+i? p_1 ), 

V = [fpiOfpivH,y - fp{0fp(v)V,y]{o , 3p+2 + ■■■ + a 4p+ ic p “ 1 ), 
vj = 0 , 

One can obtain other spurious modes by interchanging it, £ with v, 77 , 
respectively, or u, £ with w, (. For p = 2, the 27-node brick element, there 
are 21 spurious modes of the first type and 6 of the second type. 

The determination of spurious zero energy modes resulting from re¬ 
duced integration for triangular elements and elements involving nonlinear 
isoparametric mapping (the Jacobian of transformation is not constant) is 
not trivial. One can always determined the spurious modes from the zero 
eigenvectors of the element stiffness matrix numerically. 

If a spurious mode of a single element exists also in the assembly of 
elements, it is called a communicating spurious mode. If the same type of 
a spurious mode exists over the entire domain, then it is a global spurious 
mode. On the other hand if the same mode occurs over only part of the 
domain, it is called the local mode. If the spurious mode exists only in the 
single element, the mode is non-communicating. Communicating spurious 
mode is a serious flaw in practical applications. If it exists after boundary 
constraints are imposed, the global stiffness matrix is singular and the finite 
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element solution is not unique or is very sensitive to small changes in loading 
or boundary condition. It can render the solution useless (Irons and Hellen 
1978, Bic’anic’ and Hinton 1979). MacNeal (1994) showed that many of 
type 1 spurious modes above are communicating. Those in plane strain 
or plane stress are global while those in three dimensions are local. On 
the other hand, type 2 spurious modes are generally non-communicating. 
There is at least one exception — a beam modeled by reduced integration 
of 20-node brick elements with displacements normal to the beam. There 
are as many spurious modes as the number of elements plus or minus one, 
depending on end conditions. 



(b) 3 x 3 full integration 


Fig. 18.21:1 Propagation of spurious mode from reduced-integration elements. 


Spurious modes can also cause errors when there is a mixture of stiff 
and soft materials even when, strictly speaking, a global spurious mode 
does not exist. Consider the example of a stiff block on the top of a soft 
foundation shown in Fig. 18.21:1 (Zienkiewicz and Taylor, volume 1, 1989). 
The use of 2 X 2 quadrature throughout leads to the answer shown in 
Fig. 18.21:l(a) while the result for 3 x 3 quadrature given in Fig. 18.21:l(b) 
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is an appropriate solution. Even though no zero-energy mode exists for 
the assembled system, the low resistance of the material in the foundation 
allows the penetration of spurious deformation into the soft elements. 

We have given a broad coverage of the displacement model of the finite 
element method. The discussion focuses primarily on problems governed 
by linear second order differential equations. In the remaining chapters, 
we will examine mixed and hybrid finite element models, plates and shells 
(problems involve fourth order differential equations), and non-linear prob¬ 
lems. 

18.22. PERSPECTIVE 

Since its introduction in the early 1960s, the finite element method 
has become one of the most important numerical method for analysis and 
design. The method has been widely applied to all fields of engineering, sci¬ 
ence and medicine. The mathematical foundation of the method has been 
well established and understood. Three-node triangular and 4-node quadri¬ 
lateral membrane elements were first introduced because of their simplic¬ 
ity. The problem of locking in applications to bending of thin plates and 
shells and to incompressible or nearly incompressible materials was soon 
discovered. Reduced integration was introduced to alleviate the problem. 
Unfortunately such technique also brings with it zero energy deformation 
mode(s) which often requires specialized schemes to stabilize the stiffness 
matrix. Mixed and hybrid methods, such as the field-consistent approach, 
originally developed for circumventing the difficulty in meeting the C 1 con¬ 
tinuity requirement for fourth order differential equations, are alternatives 
for mitigating locking and require further development. 

In spite of the advent of computer technology, computational efficiency 
is still a consideration in engineering applications. Adaptive methods for 
remeshing and for the analysis of strain localization are needed. Simple and 
efficient elements are still essential to computational efficiency and accuracy 
especially in thin plate and shell applications. The quest for such perfect 
elements continues. 

The greatest future of the finite element method lies in its applica¬ 
tions. In the areas of metal forming and bioengineering, nonlinear material 
behaviors and large deformations are the essence. The material and thin- 
film technologies in microelectronics necessitate computational modeling at 
the atomic level. Materials can be tailored to optimize its strength, stiff¬ 
ness, fracture toughness and acoustic characteristics. The meso- (atomic 
level) and micro-mechanics of composite materials will make the finite 
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element method indispensable in material development. Applications of 
smart materials in sensors and actuators increasingly require precise 
knowledge of material performance and reliability as miniaturization of 
these devices continues. Material modeling to examine slip planes and 
effects of grain boundaries of multiple crystal systems would be required. 
Material failure often associates with local material instability or strain 
localization including material separation or void growth. Finite element 
method is again indispensable in simulation and prediction of such 
phenomena. 
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So far we have considered mostly the displacement type of finite element 
formulation. There are other types of formulation that demand less strin¬ 
gent admissible requirements on continuity. Among them are the mixed and 
hybrid formulations, which provide flexibility to deal with problems involv¬ 
ing singularities, infinite domain, and higher order differential equations. 
The displacement formulation is based on Eq. (18.14:6) or Eq. (18.16:10) 
and is an irreducible formulation , whereas the mixed and hybrid finite el¬ 
ement formulations are based on the mixed and hybrid variational princi¬ 
ples, which use multiple fields as dependent variables (see Sec. 10.3). An 
irreducible formulation is one for which none of the field variables can be 
eliminated without solving the differential equations or the weak form of 
equations. The hybrid formulation allows some field variables defined only 
along the element boundaries to be different from those within the element. 
The hybrid method is therefore most versatile. R. H. Gallagher proposed 
in 1981 to restrict the definition of the hybrid method as one which is for¬ 
mulated by multi-variable variational principles, yet the resulting matrix 
equations consist only nodal values of displacements as unknown. Pian 
(1995, 1998) surveyed and assessed the advances in hybrid/mixed finite 
element method. 

19.1. MIXED FORMULATIONS 

We can establish the finite element equations using the Reissner- 
Hellinger principle as given in Eqs. (10.10:8) and (10.10:9) that employs 
both stress and displacement as independent fields. We shall illustrate the 
derivation by considering problems of plane state. For plane stress or plane 
strain, the functional IlR e of an element can be written in the form: 

(1) A IlRe = J ^cr T e - ^<T T C e cr - u r b^ dA — J u r Tds, 

in which e is the engineering strain matrix (not tensor, see comments in 
Sec. 18.14) in terms of the displacement u, cr is the stress matrix (not 
tensor), b is the body force per unit volume , T is the prescribed boundary 
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traction , (Like last chapter, we use capital T to denote traction.) C e is 
the elastic flexibility matrix, the inverse of the elastic modulus matrix D e 
defined in Eq. (18.14:2), A e is the area of the element, and dA e is the 
boundary of A e . For linearized theory of elasticity, u, er, e are given in 
Eq. (18.14:1) for plane stress and strain, and in Eqs. (18.16:1)-(18.16:3) for 
3-dimensional problems. 

The functional for 3-dimensional elasticity is similar except that the 
area (over A e ) and the line (over dA e ) integrals in Eq. (1) are replaced by 
the volume (over V e ) and the surface (over dV e ) integrals, respectively. The 
variational functional for axisymmetric problems is similar. If the material 
is isotropic, then 


C e 


_ J_ 
~ E 


1 


— V 
0 


sym 

2(1 + v) 


l-i/ 



0 


sym 

l—i/ 

0 2 


(for plane stress), 


(for plane strain), 
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-V 1 

— 1/ —1/ 

0 0 

0 0 

0 0 


sym 


1 


0 2(1 + 1 /) 

0 0 2(l+i/) 

0 0 0 2(l + v) 


(3-D elasticity). 


The displacement boundary condition given in Eq. (18.14:4) or (10.10:3) 
is a rigid condition whereas the traction condition given in Eq. (18.14:5) is 
natural. Using the stationary condition of the variational functional, one 
can derive the equilibrium equations in terms of stresses, the constitutive 
equations relating strains to stresses, and the traction conditions on dA ffe 
as those given in Eqs. (10.10:5)-(10.10:7) in index notation. 

In finite element formulation, both the stresses and the displacements 
are approximated in the usual manner by interpolating their nodal param¬ 
eters with appropriate shape functions. The admissibility requirement are 
that either the stresses or the displacements are C° continuous and that 
the displacements satisfy the rigid condition(s). The use of C° continuous 
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stresses may result in higher accuracy than possible with the displacement 
formulation previously discussed (Zienkiewicz et al. 1985, 1985a). Poor 
choice of shape functions may result in poor results. Babuska (1971) and 
Brezzi (1974) formulated a complex criterion for the choice of dependent 
variables, known as the LBB condition. Tong and Pian (1969) argued in 
favor of requiring that the assumed stresses and strains derived from the 
assumed displacements have the same level of approximation. Fraeijs de 
Veubeke (1964, 1965), Fraeijs de Veubeke and Zienkiewicz (1967) showed 
that if the approximation for stresses is capable of reproducing precisely the 
same type of variation as that determinable from the displacement model, 
then the irreducible and mixed formulations will yield identical answers. In 
many cases, the mixed method can provide superior results for stresses by 
requiring continuity of traction between elements. 

In mixed formulation, we assume the displacement in the usual manner 
by the matrix equation u = hq and derive the matrix equation for the 
strain e = Bq. We also independently assume the stresses in the form 

<x - h CT <x, 

where <x is the unknown parameter column matrix for the stress and h CT is 
the corresponding shape function matrix, which is in general different from 
h. Discontinuous u between elements is allowed, if the assumed cr is C° 
continuous. In terms of q and <r, IlRe becomes 

(2) A n Re = o- r k au q - - & T \n a a - q T f, 

where 

(3) A K = / hJC e h ff dA, k au =/ h%BdA, 

J A' J A, 

f = / h T b dA+ / h T T ds. 

J A' JdA ac 

In these formulas, all products are matrix products. Note that k CT is sym¬ 
metric and positive definite and that f is in the same form as Eq. (18.14:14). 
One can assemble k ffl k^u, f and enforce the displacement constraint(s) at 
the boundaries to derive the matrices of the entire system for finite element 
solution. 

The system of algebraic equations for all unknown parameters is de¬ 
rived from the stationary condition of 11# defined in Eq. (10.10:8) with 
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appropriate constraints and symbolically can be written as 

-£ k <* £ k <™l 0 

. £ k £ 0 J LITqJ _ J ’ 

where the summation denotes assembling of all elements. We borrow the 

r-£ k . £ k <ru 1 

terminology of the displacement model and call the 

1 o J 

assembled or constrained stiffness matrix. 

With the added independent field <r, generally there will be more alge¬ 
braic equations. Though the assembled matrix is still symmetric, unlike the 
displacement model, it is no longer positive definite as all diagonal terms 
associated with displacement degrees-of-freedom are zero. These terms will 
become nonzero in the process of factorization or Gauss elimination if the 
unknowns are properly ordered and if the constrained stiffness matrix is 
nonsingular. 

If <r is chosen independently for each element with the displacement con¬ 
tinuous, (7 can be eliminated from the element by the stationary condition 
of n e with respect to a, 

k a a - k^q = 0, or a - k^k^q. 

Note that the matrix k<r is symmetric and positive definite. Substituting 
a above into Eq. (2) leads to the typical element functional, 

(4) n R , = iq 7 ’kq-q T f, 
where 

(5) k = kZ u k~ l k au 

is the deduced element stiffness matrix. One can then assembled the element 
matrices in the usual manner. 

Obviously the deduced k is symmetric. Its rank depends on the form 
and number of assumed stress and displacement modes. Let n a and nj. 
be the number of unknown stress and displacement parameters, and r the 
number of rigid body modes associated with the shape functions h. Then 

(6) rank of k < min(n CT , - r). 

The rank of k will be deficient and there will be spurious deformation 
mode(s) if n a <nd~ r. 
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19.2. HYBRID FORMULATIONS 

A special mixed formulation is the hybrid finite element method, which 
uses different field variables in different parts of the domain. The hybrid 
concept was initiated by Pain (1964). Tong and Pian (1969) rationalized 
the approach using the variational formulation with the functional defined 
in Eq. (10.10:10) or (10.10:15). The concept was later generalized into 
different finite element models (Tong 1970, Pian and Tong 1969, 1972) and 
used in numerous applications (Tong and Fung 1971, Tong et al. 1973, Chen 
and Mei 1974, Pian and Chen 1982, Atluri 1980, 1980a, 1995). 

Again, using plane elasticity as an example, we can write the hybrid 
element functional of Eq. (10.10:10) in a form of a matrix products equa¬ 
tion: 

(1) n He = f (u t -u t )T ds 

JdAc 

+ J ^cr T e - i<r T C e £T - u T b^ dA — J u t T ds , 

where T is the boundary traction for each element and u is the inter¬ 
element boundary displacement. In other words, T of different elements 
are independent of each other while u is the same for adjacent elements 
along their common boundaries and satisfies the condition 

(2) u = u on dA u , 

where the displacement is prescribed. 

The hybrid functional 11^ given in Eq. (10.10:10) for the entire domain 
involves multiple fields, namely u and cr within each element (the strain is in 
terms of u) as well as T and u on the element boundaries. The admissibility 
requirements are: piecewise continuity of u along inter-element boundaries, 
u = u, <fu = 0 on dA u , and smooth function for other field variables within 
the element. 

The stationary condition of II// with respect to the admissible fields 
gives the Euler equation 


(3) 

<7 X + b = 0 

(equilibrium equation in A e ) 

(4) 

e = C e cr 

(constitutive law in A e ) 

(5) 

U = u 

(element displacement condition on dA t ) 

(6) 

(j, n x (7xx T riyCT xy 

n x erxy T ^y^yy 

(element traction condition on dA e ) 
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(7) T = T (traction boundary condition on dA ae ), 

(8) V / <Su T T ds = 0, (interelement equilibrium), 

all elements •'®A e —dA ac 

where n is the unit normal to 8A e . Equations (3)-(7) do not involve sum¬ 
mation over elements because 8T, <5u of different elements are independent 
of each other. On the other hand, 8 u are common to adjacent elements and 
Eq. (8) implies inter-element equilibrium between two adjacent elements 

(9) T/+T// = 0, 

where the subscripts I and II signify that the quantities are associated with 
elements I and II, respectively. In the formulation above, inter-element 
compatibility for u is satisfied in an average sense through Eq. (5) and 
inter-element equilibrium for <r is achieved in average through Eqs. (6) and 
(8). The derivation of the element matrices based on Eq. (1) by indepen¬ 
dently interpolating all field variables in terms of their unknown parame¬ 
ters is straightforward. Since the inter-element admissibility requirement 
involves only u on the element boundaries while imposes no restriction on 
u and <r, one has great flexibility to select the shape functions to represent 
these variables. This alleviates many difficulties encountered in problems 
governed by fourth order differential equations and provides easy means to 
treat problems with singularities (Tong and Pian 1972). Also because stress 
is an independent field, it permits a better stress description to improve the 
stress approximation. 

Through selectively enforcing Eqs. (3)-(9) a priori, the number of inde¬ 
pendent field variables of Eq. (1) can be reduced, which leads to different 
hybrid models to be discussed below. 

Hybrid Stress Model. This is a widely used hybrid model originally 
proposed by Pian. Choosing an equilibrium stress a that satisfies also the 
element traction condition Eq. (6) a priori, we can simplify II He • Denoting 
it as n ws „ we obtain 

(10) A Ilnse = f u t T ds - ( \-<r T C e crdA- f u t T ds , 

JdAc JA . 2 JdA„ t 

for the hybrid stress principle Eq. (10.10:15). To derive the element ma¬ 
trices, we shall consider only the case of zero body force, i.e., b = 0. We 
assume <r in the form 


( 11 ) 

where 


u = 
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1 0 0 y 0 x 0 

(12) = 0 1 0 0 t 0 y 

3 xp 

0 0 1 0 0 —y —x 

P T = [AiS 2 ... /3 P ). 

The matrix hp is so chosen that 
i t satisfies the equilibrium equa¬ 
tions for arbitrary (3. We then 
^ assume u in terms of its nodal 
values along the element bound¬ 
aries. An example of quadrilat¬ 
eral element is shown in 
Fig. 19.2:1 with (q u q 3 , q 5 , q 7 ) 
and ( q 2 , </ 4 , <76, 9s) being the 
nodal values of u, v, respectively. 
Fig. 19.2:1. Quadrilateral element. Then U can be Written as 

U 

(13) u = = hq, 

v 

where 

1 — s/si t 2 0 s / s i,2 0 0 0 0 0 

0 1 — s/si t 2 , 0 s/s 1,2 0 0 0 0 

(between nodes 1 and 2), 

0 0 1—s/S2,3 0 s/S2,3 0 0 0 

0 0 0 1 -s/S 2,3 0 s/s 2 ,3 0 0 

(between nodes 2 and 3), 

's/s 4 ,i 0 0 0 0 0 l-s/s 4 ,i 0 

0 s/s4 ; i 0 0 0 0 0 1 —s/s4j 

(between nodes 4 and 1). 





Here s is the distance along the element boundary and s,,j is the distance 
between nodes i and j. Note that u has the same displacement along the 
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element boundaries as that of a 4-node quadrilateral displacement element. 
Also, at the element boundaries, one has 

T = n' T (T = n lT h p (3 = p T h^n', 

where 

m n x 0 n v 
n lT = V 

0 n y n x 

in which n’s are the components of a unit normal to the boundaries. Recall 
that in Chapters 16 and 17, we defined traction as T = n • cr in tensor 
notation, where <r is a second rank tensor and n a unit vector. Presently 
<7 is defined in Eq. (18.14:1) as a column matrix. In matrix operation, we 
have to define n' T as above to obtain the correct traction 


Hx&xx “f" ^y&xy 
Tlx& x y d" TlyGyy 
A substitution of Eqs. (11) and (13) into Eq. (10) yields 

(14) Iltfsc = P T k 0q q - i pFkppfi - q T f, 
where 

(15) A kpp = / hpCehpdA, kp q = / hln'hds, 

Ja c JdA „ 

f = / h r Tds. 

J 9A„ C 

The element force matrix f is the same as that of the displacement model. 
For a constant body force b T = [6 X b y ], 

a = hp{3 + a p = hp(3 + [b x x b y y] T . 



There will be additional terms in f through cr v (Tong and Rossettos 1977). 
Then, the element force matrices of the two approaches can be different. 

In Eq. (15) ft of different elements are independent of each other and 
be eliminated from the stationary condition of II// s e with respect to (3. 

(16) kpp(3 = kp g q or (3 = k^k^q. 

Substituting Eq. (16) into Eq. (14), we obtain again the quadratic form 

(17) n HSe = ^R T kq - q T f, 
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for the element, where 

(18) k = kjgkg^k p q . 

The element matrices can be assembled and constrained in the usual manner 
to obtain the finite element equations for the entire system. 

Similar to Eq. (19.1:6), we have 


rank of k < min(n / g, n q — n r ), 


in which np, n q , n r , are, respectively, the number of /?, q and rigid body 
modes. In other words, the rank of k is no larger than the number of /3’s. 
For a quadrilateral element, there are 8 q's and 3 rigid body motions. If 
the number of /3’s is less than 5, the rank of k is deficient and spurious 
deformation mode(s) which contributes no energy to the element will exist. 
The five-/? hybrid stress element is commonly used. In this case, Eq. (12) 
becomes 


11,3 = 
3x5 


1 0 0 y 0 
0 1 0 0 x 
0 0 10 0 


/3 T = \0l 02 03 04 0s] . 


For elements of distorted geometry, instead of using lap as function of 
the Cartesian coordinates as in Eq. (12), which makes the assumed stresses 
satisfy equilibrium condition exactly, Pian recommended expressing stresses 
in terms of the nature coordinates £, q of the element. In this case the 
assumed stresses satisfy equilibrium condition only in an integral or average 
sense (Pian and Sumihara 1984). 


Hybrid Displacement Model (Tong 1970). If the constitutive laws are 
satisfied, and if the stress is expressed in terms of strains, the functional 
Eq. (1) becomes the hybrid displacement principle Eq. (10.10:17) and can 
be written as 


(19) A U HD e = f (u r -u T )f ds 

J oa k 

+ J ^ e r D e e — u r b^ dA — J u Tr Tds, 


where D e is the elastic modulus matrix relating strain e to stress er. Along 
the traction prescribed boundaries, there is no adjacent element and no 
restriction on the selection of u, one can simply set u = u on dA ae . In this 
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case, Eq. (19) becomes 



in which the integration over dA ae is removed from the first integral and u 
is replaced by u in the last integral. 

One can reduce further the number of field variables in the functional 
by requiring the traction stress relation, the equilibrium equation and the 
stress displacement relation 

(21) T = T = n' 3 a , <r >x + b = 0, er = D e e = D e d e u 

to be satisfied a priori, where d e is defined in Eq. (18.11:6) and d e u is the 
strain matrix. Then Eq. (19) is reduced to [Eq. (10.10:20)] 

(22) I l H De = f u t T ds 

JdA c ~dA oe 

( e r D e edA+ f u r (T-T )ds. 

If the displacement u is divided into the homogeneous and particular parts 


(23) 


U = U/, + Up 


such that their corresponding stresses <Th and cr p satisfy 


(24) 


dcr h 

fix 


dcr 


dx. 


£+b = 


0. 


Then II //De becomes 

(25) U H De= f u r (T h +T p )ds- f \(ul + u^)T h ds 

JdA'-dA ac JdA c * 

+ I u£(Ta + Tp -T)ds + const, 

JdA ac 


which involves integrals along element boundaries only. Note that u p , T p , 
T are known functions. The functional above involves only two independent 
field variables u^, and u. 

We further separate u/j into two parts 


(26) 


ua = n hh + u th 
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in the way that the traction Taa, Tth, derived from u^, u t A respectively, 
satisfy the condition 

(27) Tfcfc=0, T t h = T - T p on 9A ae , 

Let Th be the traction derived from ua, then 

(28) T h = Taa + Tth • 

A substitution of Eqs. (26)-(28) into Eq. (25) yields 

(29) n HDe= [ ( U T T; t A - \^hh T hh ) ds 

JdA'-dA,,* \ 1 1 

4- [ (Tja 4- Tp) ds 

JdA'-dA ac 

~ La 8A S U “ Tt '* + (\ + u p) Thh ds 

4- [ i u/ Ul (T p - T) ds 4- constant. 

J dA ae 2 

Note that Tax and Tth are traction associated with uaa and u ( a, respec¬ 
tively, and that the displacement uaa satisfies both the homogeneous equi¬ 
librium equations and zero traction condition on the traction prescribed 
surfaces. Only the first integral involves quadratic terms of the field vari¬ 
ables uaa since Taa is a function of uaa, whereas the rest of the integrals 
involve only linear terms of Uaa- If there is no body force, i.e., b = 0, we 
have Up, T p = 0. If the traction on the traction prescribed surface is also 
zero, then u t A, T t A = 0 and all last three integrals in Eq. (29) vanish. For 
3-dimensional problems, one simply replaces A’s by V’s and ds by dS. 

Problem 19.1. Show that the element stiffness matrix of the hybrid stress 
model is independent of the body force. 

Problem 19.2. Derive the element force matrix of the hybrid stress model 
for plane elasticity with a constant body force. 

Problem 19.3. Show that the hybrid stress stiffness matrix for a triangular 
element is the same as that of the constant strain triangle of the displacement 
model regardless of the number of /3’s in the stress mode. 
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Problem 19.4. Derive the hybrid stress element stiffness matrix for a 8- 
node hexahedral. Determine the rank of k for different number of stress modes. 

Problem 19.5. Derive the element stiffness matrix of the hybrid displace¬ 
ment model: 

(a) Use all three variables u, u, T as independent fields. 

(b) Enforce T = T a priori and just use u, u as the independent field 
variables. 

(c) Discuss how the rank of k is affected by the number of unknown parameters 
of u and T. 

Problem 19.6. Derive the element force matrix of the hybrid displacement 
model corresponding to the field variable u. 

Problem 19.7. If the conditions given in Eq. (19.2:21) are satisfied a priori, 
show that Eq. (19.2:19) can be reduced to Eq. (19.2:22). 

Problem 19.8. Consider the case that T(= rijOji) and that Oji is in terms 
of displacements through the stress strain relation (i.e., cr = D e e = D,.d e u). 
Show that the first variation of TIhOc in Eq. (19.2:19) with respect to u gives the 
matching condition 


u = u on dAt , 

T = T on dA„' , 

and the equilibrium equation in terms of u in A e . 

Problem 19.9. In Problem 19.7, if the displacement is separated into the 
homogeneous and particular solutions u/, and u p such that Eq. (19.2:24) is satis¬ 
fied, show that TIhdc can be reduced to Eq, (19.2:25). 

Problem 19.10. In Problem 19.9, if one further separates u>, = Uhh + u th 
as in Eq. (19.2:26) with corresponding Thk , Tth. satisfying Eq. (19.2:27). Show 
that Eq. (19.2:25) reduces to Eq. (19.2:29). 

19.3. HYBRID SINGULAR ELEMENTS (SUPER-ELEMENTS) 

For problems whose solution is smooth, the order of the finite element 
approximation is proportional to the order of the complete polynomial used 
for the shape functions. When a problem involves singularity, the order of 
approximation is dominated by the order of the singularity (Key 1966, 
Tong and Pian 1972). For example, in fracture mechanics, the asymptotic 
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solution near a crack tip is (Sih and Liebowitz 1968) 


( 1 ) 


( 2 ) 


K, 9 
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1 — sin - sin — 

2 2 

9 . 30 
1 + sin - sin — 

9 30 

sin - cos — 

2 2 


Kn 

\/2nr 


COS 

K 


9 30 

2 + cos — cos — 


0 . a 30 

cos — sin | cos — 


„ . , . 0 . 30 

cos II 1 — sin - sin — 


- cos0) 
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cos - 

2 


+ *"./_ 
2G V 2tt 


sin - (2 4- n + cos0) 
$ 

cos - (2 — K — cos 0) 


where G is the shear modulus, r is the distance from the crack tip, 0 is the 
angle measuring from the crack plane, and k = 3 — (for plane strain) 
and k = (3 — u)/(l + u) (for plane stress), in which v is the Poisson’s ratio, 
Kj and Kjj are constants commonly referred to as the stress intensity 
factors associated with symmetric (Mode I) and antisymmetric (Mode II) 
deformations with respect to the crack plane. The stresses and strains are 
proportional to 1 / 1 Jr and the displacements are proportional to \fr. 

Near the singularity, the polynomial-base finite element method per¬ 
forms badly. Henshell and Shaw (1975) and Barsoum (1976, 1977) shifted 
the midside node to the quarter point along a side in quadratic isopara¬ 
metric elements. The isoparametric transformation has a singularity at the 
corner near the shifted node. This generates a singular strain of 1 /sfr along 
the edge with the shifted nodes. To illustrate this approach, let us consider 
a 6-node rectangle (Fig. 19.3:1) with the 5th and 6th (identified as node 8) 
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Fig. 19.3:1. A quarter point rectangular element. 

nodes being at the quarter point from node 1 along side between nodes 1 and 
2 and between nodes 1 and 4, respectively. Isoparametric transformation 
is used to map the two quarter point nodes to the corresponding midsides 
in the i] plane. The transformation is 


x — ^ Xjhj 4- x&hg = a£ + 2:5/15 = a 

i= 1 

£- I(l-e 5 )(l-7?) 

S 

y-Yl Vihi + yshs - br i + y» h 8 = b 

77-1 (1 -r ? 2 )(l -0 


i=l 


which maps the midside points (0, —1) and (—1,0) in the rj plane to the 
quarter points {—a/2, —6) and (—a, —6/2) in the x, y plane. The inverse of 
the Jacobian 


J- 1 = 

d£ dr] 
dx dx 

d£ dr] 
dy By 




2 + r) - 

a 

h 2 -l 

2a 

00 


e-i 

L 26 

2 + £ — £77 

6 

(3£t? - £ - I ~ 5)(^ - £ - J? - 3) 


is singular at £ = n = — 1. It can be shown that J -1 has 1 jyfr singularity 
at node 1 at x = — a and y — —b, where r is the distance from the node. 
The strains e xx , e yy and e xy are proportional to J -1 and thus have the 
same singularity as J -1 . 
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The hybrid approach allows the incorporation of appropriate singularity 
in the shape functions of field variables (Tong et al., 1973, Tong 1977, 
1984, Lin and Tong 1980, Piltner 1985, Jirousek 1987, Jeong et al. 1996). 
Consider a domain with a crack as shown in Fig. 19.3:2. One can use 
one element to surround the crack tip and incorporate the appropriate 
singularity in the shape functions for the element. There is no singularity 
in the region outside the crack element. Regular polynomial-based elements 
discussed before can be used. The assumed functions for the crack element 
and those for the surrounding elements are, in general, not continuous along 
their common boundaries. The hybrid model enforces continuity along the 
inter-element boundaries in an average sense in the final solution. Such an 
element is also called a super-element. 

The crack tip can be anywhere within the element. Thus, one can use 
the same element with the crack tip at different locations (Fig. 19.3:3). 
This feature makes the crack element ideal for crack growth simulation. As 


5 4 



(b) 10-node crick element (c) S-r>ode crack element 

Fig. 19.3:2. Crack element superelement connects to: (a) and (c) 4-node quadrilaterals 
and (b) 8-node quadrilaterals. 




Fig. 19.3:3. Crack growth. 
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the crack advances within the element, one simply changes the crack tip 
location to reflect the crack extension, calculates new crack element stiff¬ 
ness matrix and analyzes the new configuration. No remeshing is required. 
When the crack tip gets too close to the element boundary, the solution 
will deteriorate because the adjacent element is normally polynomial based. 
One may then include the elements close to the crack tip to form a new and 
enlarged crack element. At the same time the part of the crack element far 
from the crack tip can be converted into regular elements. In this process, 
one often can retain the same global nodal labels for the whole domain with 
changes in the assembly list only, which defines the relation between the 
local and the global labels of each element, for the altered elements. Even 
if some minor adjustment of nodal locations may be needed for the new 
elements to avoid having elements with bad aspect ratio or severly acute 
angle, there will be no complicated remeshing involved. 

The shape of crack element, generally a polygon, and the number of 
nodes along the element boundaries can be arbitrary. The spacing of nodes 
can be varied to coincide with the nodes of adjacent elements. A crack 
element with 10 nodes is shown in Fig. 19.3:2(b). The origin of the coordi¬ 
nates is at the crack tip and the crack surfaces are along the negative x-axis 
(i.e., y = 0 t ,2 < 0). 

The procedure of deriving the element matrices based on the hybrid 
displacement functional Eq. (19.2:19) is as follows: the asymptotic solution 
of Eq. (1) is used to guide the selection of the assumed functions for the 
field variables within the element. The crack element connects to the adja¬ 
cent elements along the boundaries by nodes from 1 through 9. There is no 
singularity along these boundaries. We simply assume the element bound¬ 
ary displacements u in the same form as those of the adjacent elements to 
assure interelement compatibility. The crack surfaces are between nodes 9 
and 10 and between nodes 10 and 1 where traction is prescribed. Since no 
adjacent element is involved along the crack surfaces, there is no restriction 
on the selection of u. We simply assume u = u along these surfaces. Then, 
Eq. (19.2:20) is used to evaluate the element matrices. 

We shall first discuss the selection of u. Figure 19.3:2(a) depicts a 
7-node crack element connecting to a 4-node quadrilateral element while 
Fig. 19.3:2(b) illustrates a 10-node crack element adjacent to an 8-node 
quadrilateral. Thus, u is needed from nodes 1 to 6 for Fig. 19.3:2(a) and 
from nodes 1 to 9 for Fig. 19.3:2(b). In the former case, u should vary 
linearly between nodes in the form of Eq. (19.2:13) in which q 2 j-i, q 2 j are 
nodal values of u, v at node j. For instance, the crack element is adjacent 
to a 4-node quadrilateral between nodes 3 and 4, all components of h of 
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Eq. (19.2:13) for the edge between these two nodes are zero except 

(3) hi 5 = /l2,6 = 1 — s/S33 , hij = /l2,8 — s / s 3,4 > 

where s is the distance from first node (node 3 in this case) and Si,j denotes 
the distance between nodes i and j. In Fig. 19.3:2(b), the element adjacent 
to nodes 4 through 6 is an 8-node quadrilateral and u should be quadratic 
between these nodes. Then, all components of h for this edge are zero 
except 



(4) 


h 1,9 = ^2,10 


S 

•54,5 


f ^4,6 & \ 

\ s 4,6 - *4,5/ 


hi, 11 = ^ 2,12 = 


f $ 4,5 S \ 
V s 4,5 — $4 ,6/ 


S 

s 4>6 


We can similarly define h' s of u = hq for other edges. 

To simplify subsequent discussion, we introduce a complex function u c 
and a complex interpolation function matrix (row vector) h c such that 

(5) u c = u + iv - h c q, 

where it, v are the x, y components of u. The components of h c are 


( 6 ) 


hij~i = hi,2j—i 


h'j, — ih' 


2,2j 


in which i = y/—l is the imaginary number and h's are real function. 
Note that h 2 j-i = We will use i as yf-i throughout the remaining 

chapter. 


Hybrid Displacement Element. We shall discuss a method of assuming 
<t, u, u, T, T within the element. Consider the choice of field variables with 

(7) T = T = n /T £ t, a = D e e = D e d e u. 

In this case, we only need to assume u inside the element and u along the 
element boundaries, and derive the stresses and the traction from equation 
above. Since there is no compatibility constraint on u, one just pick func¬ 
tions that contain appropriate singularity for u. We introduce the following 
coordinate transformation: 


(8) 

x=e-v 2 , 

y- 2 ^ 77 , z = x + iy = (f + ir]) 2 

(9) 

£ = \/(r+ x)/2. 

, T)= sgn {y)\/{r -x)/2. 
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where r = sjx 1 -f- y 2 and sgn(j/) equals 1 if y > 0 and equals — 1 if y < 0. 
Note that £ and p are not the natural coordinates defined in Chapter 18. 
We assume u in the form 


u = pa — 


1 

0 


Z 

0 


V 

0 


£ 2 

0 


where 


0 0 0 0 
i z v e 


= [aj a2 • • • a p a p+ i • • • c*2 P ]. 


In the equations above, p equals 3 for linear and equals 6 for complete 
quadratic polynomials. 

The derivatives with respect to x, y are calculated as follows: 
d__dZd_ cty j9 _ _1_ ( p d_ _ d\ 
dx dxd£^dxdp 2r \ d£ ^dp) 

± = d i d L + dvd_ = j_( d d_\ 
dy dy d£ dy dp 2r \ d£ s dp} ’ 

which give an 1 fy/r singularity. Any polynomial of £,p, is a polynomial of 
v'r near the crack tip. If u is assumed to be a polynomial of £, p, then u is 
a function of y/f and its gradients in the z -plane is proportional to l/s/r. 
The singularity of strains and stresses are 1 js/r at the crack tip. Thus, the 
singularity of the strain energy is 1/r. Substituting the assumed functions 

u = h c q, u = pa, T = n' T D e d e pa 


into the hybrid displacement functional Eq. (19.2:20), we obtain 
( 10 ) 
where 


n HDe = a k u „q - -a T k uu a - a T f u , 


a T k tl <iq = f u t T ds — a T / (d e p) r D e n / hds 

JdA c -dA„ c UdA c -dA„c 

(11) a T k uu a — 2 f u t T ds - f e T D e edA 
JdA' JA t 

= a T 2 f p T n'D e d e pds— j (d e p) T D e d e pd J 4 

L J dAc—dAoc JAc 

a T f u = f n T hdA + f u T Tds 
J A « JdA„ r 


ot T f f p T b dA + f 
\JA' JdAvt 


p T T ds \ . 
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Note that d e p is equivalent to B of the displacement model. Since the as¬ 
sumed u of different elements is independent of each other, we can eliminate 
a from the stationary condition UnDe with respect to a: 

( 12 ) k uu a = ku^q — f u , or a = k^k^q - k^Jf u . 

A substitution of the result above into Eq. ( 10 ) yields 

(13) H HDe = ^q T kq-q T f, 
where 

(14) k = k^k-Jk™ > f = . 

Care must be taken in carrying out the integration near the crack tip in 
Eq. (11). The integration along dA ae involves terms of order y/r and the 
integrand over the element involves terms of order 1 /r. Using the Gauss 
quadrature based on the x, y coordinates for numerical integration will 
result in lose of accuracy (convergent rate) because numerical integration 
simply approximates the singular integrand as a polynomial. Therefore, we 
ought to carry out the integration analytically near the crack tip or over 
the £, r 7 -plane. Unfortunately the element shape in the transformed plane is 
no longer a simple rectangle, which makes the area integration nontrivial. 
Either analytical integration or integration over the f, 77 -plane makes the 
implementation of the hybrid approach more complicated. 

Alternative Hybrid Formulation. To circumvent the difficulty dis¬ 
cussed above, we choose u in the form of Eq. (19.2:23) such that the 
corresponding stresses (Th and a p are the homogeneous and particular so¬ 
lutions of the equilibrium equation defined in Eq. (19.2:24). We can then 
evaluate the element matrices using the functional given in Eq. (19.2:25), 
which involves line integrals only. Using this type of assumed stresses and 
displacements, the hybrid displacement and hybrid stress models become 
the same. 

For simplicity, we shall consider only the case of zero body force in the 
subsequent discussion. Then u p = T p = 0 and Eq. (19.2:25) reduces to 

(15) n WD e = / u r T h ds - i / ulT k ds 

JOAc-dA,,' 1 JdA c 

+ uf (T h — T) ds + constant. 

JdAtre 

For isotropic materials, the homogeneous solution u/ t and <r/, can be derived 
from the complex potentials ip, \ as given in Eqs. (9.6:3)-(9.6:5), which are 
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repeated here for convenience 

°xx 4 <Jyy = 2[V''(z) + ^'{z)\ , 

(16) a yy - a xx + 2 \ia xy = 2 \zip"{z) + x"(z)], 

2 G(u + iv) — Kip(z) — zi>'(z) - x'( z ) i 

where 2 = x-Hy is a complex variable, the over bar (.) denotes the complex 
conjugate of the relevant function, one and two primes denote first and 
second differentives with respect to z, and 

k = 3 — 4i/ for plane strain 


= (3 - v)/{l 4 v) for plane stress. 

The complex potentials ip, \ are two arbitrary analytic functions of z. 

To construct the element matrices, we assume the complex potentials ip, 
X 1 in terms of unknown parameters and derive the stresses and the displace¬ 
ments according to Eq. (16). (Note that x does not appear explicitly. Thus, 
we may assume x' directly.) The derived stresses and displacements auto¬ 
matically satisfy both the compatibility and equilibrium equations, within 
each element. We shall choose ip, x' containing the appropriate singularity 
at the crack tip. This can be done easily by introducing the transformation 

* = = (£ + iv) 2 , 


which is known as the conforming mapping (Muskhelishvili 1953) that maps 
the x, y plane to the right half £, 77 plane. It can be shown that £ and r/ 
are related to x and y by Eqs. ( 8 ) and (9). The upper (y = 0 + ) and lower 
(y = 0 ~) crack surfaces on the negative x-axis are mapped, respectively, 
to the positive and negative parts of the 77 -axis. If we assume ip, x! as 
polynomials of c, the derived stresses and strains will contain automatically 
the appropriate square root singularity at the crack tip. 

The derivatives with respect to z and £ are related by the equation 
lz ~ 'hlk' Erom Eq. (16), one can write the stresses and displacements as 
functions of £, 77 


, _ 1 <#(<0 , 1 dip{<i) 

+ <7yv ~ d< + C d? 


' @XX 4* Qi&XV - rv 1 


SI A 

2? dc 


1 

C dc 


1 <* x '(0 

C dc 


2 G(u 4 - iv) 


= Kip(q) - 


c 2 dip{<j) 
2 c dc 


- X '(0 • 


(17) 



776 


MIXED AND HYBRID FORMULATIONS 


Chap. 19 


Clearly, the displacements and stresses are proportional to (, 1/C (i.e., y/r, 
l/y/r), respectively, at the crack tip. 

We can enforce the traction condition a priori to further simplify the 
finite element formulation. Let us assume ip, in the form 


i> = ipth+ rphh , x' = X'th + X'hh - 


where ip t h, x'th are particular solutions, satisfying the non-zero prescribed 
boundary traction while iphh, x'hh denote the homogeneous solution satisfy¬ 
ing traction free condition on the crack surfaces. For a prescribed traction 
of the form 


( &xy)\ d" (u":cy)2*E 
(u'yy)l d" (Uyy) 2 £ 


on the lower crack surface (y = 0 , x < 0). 


T = - 


{&xy )l+( Gxy )2X 
(°yy) 1 d" (°yy) 2% 


on the upper crack surface (y = 0 + , x < 0). 


where the (,)’s are known constants, one can show that 


( 18 ) 


V’t/*=c 2 z 2 , x'th = c i z , 


where 


Cj — (Uyy)l -|- t(<T X y)i , C2 — (Uyy) 2 /fi d" i(c"xy)2/2 • 

The particular solution for ip t k, x'th * s not unique. It can be shown that 
the solution for ip, i s independent of the specific form of the particular 
solution if the assumed homogeneous solution is sufficiently general. For a 
more general prescribed traction, one can approximate it as a polynomial 
near the crack tip and derive the particular solution accordingly. 

We can derive the homogeneous solution using the analytic continuation 
technique [Muskhelishvili 1953, Tong 1984], From Eq. (9.6:7), the traction 
free condition can be written as 

iphh(z) + ztp' hh (z) + x'hh( z ) = 0 

on the crack surfaces, z = x(< 0), in the z-plane, or 

(19) + Xhfcfr) = 0 

on the imaginary axis, <; = irj, in the ((-plane. The analytic continuation in 
the present case simply replaces c by —£ in Eq. (19) because <f = — ? on the 
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imaginary axis. This makes Eq. (19) an analytical equation of C- Thus x'hh 
can be written as an analytic function in terms of iphh- 

(20) x'hhk) = | — , 

where the short bar over ifihh denotes the complex conjugate of the function 
while the independent variable itself remains in the form as given in the 
argument. For instance, if 

(21) i>hh(<;) = aic + a 2 c 2 + a 3 c 3 + --- , 

then 


4>kh(~<;) - ai(~0 + a2(-0 2 + a 3 (—c) 3 + • • • • 


With x'hh i* 1 terms °f i’hh as in Eq. (20), the traction free condition Eq. (19) 
is automatically satisfied, since c = — c on the imaginary axis in the £-plane. 

We assume the complex potential iphh as a simple polynomial of £ as in 
Eq. (21) and derive x'hh according to Eq. (20) 


(22) x'hhiO = “lM-0 ~ \ 6 ^r- 


— ^ai — - di'j g — (a 2 + a2)c 2 + ~ 2° 3 ) ^ ~ 


where a’s are complex constants. The stresses derived from the assumed 
ip hh , x'hh * n Eqs. (21) and (22) satisfy zero traction on the crack surfaces. 
One can show that the imaginary part of a 2 gives no contribution to stresses 
and, therefore, has no contribution to the element matrices. It actually 
represents a rigid body motion and can be set to zero without any loss of 
generality. 

As mentioned before, the assumed complex potentials of Eqs. (21) and 
(22) give an 1 fy/r stress singularity at the crack tip. The coefficient of the 
singular term at the crack tip is a measure of the intensity of the singularity. 
Thus, one defines [Eq. (9.6:35) with the coordinates at the center of the 
crack] 


(23) Kj — iKji = 2\/27r lim \fz = V2n ~ 
' z-*o dz dg 


\Z2nai 


where Kj, Ku are real constants often referred to as the stress intensity 
factors (Sih and Liebowitz 1968). They are associated with the symmetric 
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{mode I) and antisymmetric ( mode II) deformations about the x-axis, re¬ 
spectively, Mode I is a crack opening deformation and mode II corresponds 
to shear along the crack plane at the crack tip. Using the hybrid approach, 
we can evaluate the stress intensity factors directly from ai. Stress inten¬ 
sity factors are important parameters in fracture mechanics for structural 
integrity assessment (Tong 1984). 

For problems with non-zero prescribed surface traction, let u t A, T t h and 
Ufc, T* be the quantities derived from Xth an d Xhio respectively. 
Taking into account that T t h = T and Taa = 0 on the crack surface dA ae , 
we can show that Eq. (15) reduces to Eq. (19.2:29) with u p = T p = 0 in 
the form 

(24) n H De = [ ( u T T/p, - l-ul h T hb \ds + [ u T T th ds 

JdA e -dA „„ \ 2 J J dA e ~dA ac 

(“WiT th + uf h T hh )ds — f l ul fl T ds , 

JdA ac 2 

which involves line integration only. All integrals except the last one do 
not involve the crack tip. One can integrate them using regular numerical 
integration. The following identity is used for the integration in Eq. (24) 

(25) T c = T x - iT y = ^ a vy e ~io _ a vv ~ + 2tg *v e <* 

= 1 ^(0 | c -w f i 2 d ri #(c) i [ 1 ^(0 

<r efr ? dc \2 qd<; <;d<; <r d<, J 

where 9 is the angle between the x-axis and the unit normal to the bound¬ 
ary. For elements with straight edges, 6 is a constant along each side. 
Substituting ip th , x\h fr° m E 9- ( 18 ) and i’hh, Xhh fr° m E 9 S ’ (21) and (22) 
into Eq. (25), we obtain 


-sjL 


(26) T hh + T th = T C = T X - iT y 

(27) u hh -I- u th = u + iv 


where the subscripts hh and th denote quantities derived from iphh, Xhh an d 
4>th, Xtk' respectively. Note that bold u and T denote matrices and sub¬ 
scripted italic u and T denote the corresponding complex quantities, e.g., 

Uhh Re {uhh) T i Im (uaa) , Taa = He (Taa) T i Im (Taa) , 

u 1 [ Re(uAA) 1 \T X [Re(TAA) 

u/i/i - > Taa — ~- 

Llm(vA/>) J Ty\ hh Llm(TAA) 
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where Re(.) and Im(.) are, respectively, the real and the imaginary parts 
of the corresponding function. Thus Re(uhh) and are, respectively, 

the x- and ^-components of u hh and similarly for the traction Thh- All u’s 
and T’s are complex functions of c, c and can be written in the form 

(27 a) Thh = G c a, Uhh — h c a 

in which 


Cr c = [ G\ G 2 ■ • • 11 h c = [h c i h c 2 • • • ], at T = [ai ], 

ai = ax + iat 2 , a 2 = <*3 > a p — a 2p - 2 + ioi 2 p-i for p > 3 


with a’s being real unknowns and o’s being the coefficients of Eq. (21). Us¬ 
ing Eq. (18), (21), (22), (25) and (26), after some algebraic manipulations, 
we find 


Gt 


G 2 


(J- + exp(-W) + 

(c “ exp (~ i6 ' ) + 


C 2 — C 2 exp(i0) 


C 3 2 ’ 

C 2 + 3c 2 exp(i0) 


G 3 = 8 cos 9 , 


G 4 = 3 


(c + c)exp(~i0) 


c 2 — c 2 exp (iO) 

T ~ 


G 5 - 3i (c - c)exp(-i0) 


C 2 - 5<T 2 exp(i0) 
C 2 


h c i = (2«c - c - c 2 /c)/(4G) , h c2 = i(2/cc - 3c + c 2 /c)/(4 G) , 

fi C 3 = («C 2 + 2c 2 - c 2 )/(2 G ), h ci = (2/cc 3 + C 3 - 3c 2 c)/(4G) , 

h c5 = z(2«c 3 - 5c 3 + 3c 2 c)/(4G) ; 


Tth = 2 (c 2 c 2 + c 2 c 2 ) exp(-ifl) - (2 c 2 c 2 + c x ) exp(i0), 
UtA = (kc 2 c 4 - CK 2 - 2c 2 C 2 C 2 )/(2G) . 
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Note that u t h and Tth are known functions of the prescribed quantities such 
as the constants ci and c 2 . Using Eqs. (5), (26), (27) and (27a), we can 
express the integrands of Eq. (24) in terms of the complex function matrices 
h c and G c as 

n T T hh = R e(uT h h) - Im (vT hh ) 

= Re (u c Thh) = q T Re (h^G c )a, 
u 7 T t h - Re ( u c Tth ) = q T Re (h ^T th ), 

UuT w , = Re {u hll T hh ) - a r Re (h^G c )a, 

U fih^th + U^T/ift = Re (UhhTth + UfhThh) 

= ot T Re (hfTtH + G^uth) 

on dA e - dAae, and 

u^T - Re ( u hh T th ) = oc T Re (h *T th ) 


on dA ae ■ A substitution of these complex functions into Eq. (24) yields 

(28) rif/^ = a T k„uq - ^ a 7 k uu a - a T f u - q 3 ’fa , 

where 


a r k u *q = 

/ 

u 7 Thhds — a 1 Re ( 1 

G 7 hcds^) q, 


JdA'-dA„' 

\JdAt-dA„e J 

a k m/Oi — 

/ 

ul h Thhds = a T Re ^ 

f hjG c ds \ a, 


JdAe-dA ac 

JdA'-dA„ c J 

rv T f - 

f 

u hhTth + U-f h Thh , 

f u lhTtk drt 

CX Iy — 

JdAe-dA a e 

2 dS + 

JdA oc 2 


a T r 

— Re 

f (hjT t H+Gj 

u th)ds + / h^T thds 


2 u 

'dA'-dA ae 

•J dAae 

II 

d? 

Ei 

a* 

-/ 

u r T thds — -q T Re 

( [ hjT th ds) . 


JdAe-dA 


\JdA'-dA„ t J 


It can be shown that k uu is symmetric and positive definite. If we had 
allowed Im(a 2 ) to be an unknown parameter in Eq. (21), k uu would be 
positive semi-definite with u + iv — Im(o 2 ) being the zero eigenvector. 
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Note, that T t h = T X — iT y on the crack surfaces dA ae where T x , T y are the 
prescribed traction and that u t h is a known complex function related to T x , 
T y in terms of the prescribed constants ci and 02 - 

The integration along dA e — dA ae (from nodes 1 to 9, Fig. 19.3:2b) can 
be performed numerically in the 2 -plane where £, r/ are related to x, y by 
Eq. (9). For the integrals over dA ae , it can be integrated analytically in 
the £-plane if the prescribed traction is a polynomial. Note, along dA ae , 
C = ir\ and ds = —2r)dT]. 

Like before, using the stationary condition of UnDe with respect to a 
leads to 


ku*q — k uu a — f„ = 0. 

Solving for a and substituting into Eq. (28) yields 
n HDe = ^q T kq-q T f, 

where 

k = kJ fl k“Jk U 4i, f = fu + kJak^fu, 

are the equivalent element stiffness matrices and force matrices, respec¬ 
tively. These element matrices are to be assembled with those from the 
other part of the domain to solve the crack problem. 

When the crack element is too small or when the crack tip is too close to 
an element boundary, the crack element will lose its effectiveness (Orringer 
et al. 1977) because the element boundary dispalcement u and the displace¬ 
ments of the adjacent elements are usually polynomials. To improve the 
accuracy of crack analysis, instead of reducing the size of the crack element, 
one increases the number of boundary nodes of the crack element and the 
order of the polynomial for the complex potential i/j. One should place the 
element boundaries a distance away from the crack tip, if possible. 


Problem 19.11. Derive the following expression from Eq. (19.3:17) 

20 1 - 1 d 4 - 1 d 

2(a vv + ta xy )- + r- 5r 

ri #(f) i. i dx'(c) 

2c dc [ c dc J c dc 


Find a particular solution ip p , x' P that satisfies the traction condition 

{&xy)nX 

{<?yy)nX n 

where n is an integer and (a xy ) n , ( a yv ) n are known constants. 


T = ± 


on the crack surfaces , y = 0^, x < 0, 
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Problem 19.12. Show that Eq. (19.3:25) is true. 

19.4. ELEMENTS FOR HETEROGENEOUS MATERIALS 

The super-element concept can be applied also to heterogeneous mate¬ 
rials. Tong and Mei (1992) treated composite materials made of periodic 
fibers imbedded in a matrix material. They used the method of homoge¬ 
nization and modeled an individual fiber and its surrounding material as 
a basic cell of the structure. The cell was modeled as an super-element of 
polygons. Accorsi (1988), Ghosh and his associates (1991, 1994), Zhang 
and Katsube (1994) modeled the inclusions of a heterogeneous material as 
two-dimensional elliptical particles of various sizes, aspect ratios, and ori¬ 
entations scattered in the parent material. They used polygonal elements, 
called them Voronoi cell elements, each of which contains one or more in¬ 
clusion. The hybrid approach is used to determine the stiffness matrices 
of such elements for inclusions of various shapes and sizes. Such approach 
can be applied to elastic-plastic and thermo-elastic problems (Ghosh and 
Moorthy 1995, Ghosh and Liu 1995). 

19.5. ELEMENTS FOR INFINITE DOMAIN 

The hybrid technique provides a means to treat problems involving an 
infinite domain. For a problem in which the solution decays rapidly toward 
infinity, one may approximate an infinite domain with finite boundaries far 
from the region of interest. One then solves the problem using elements in 
the finite domain. However, for certain problems (such as those involving 
wave propagation), the disturbance is often transmitted to infinity. Any 
truncation to a finite domain will produce a false solution. 

To treat the problem using a hybrid approach, we divide the infinite 
domain into two regions R\ and R 2 separated by a curve C (Fig. 19.5:1). 
The Ri is sufficiently large to enclose the area of interest including the 
body B and R 2 is the rest of the domain extending to infinity. The region 
fix is divided into a finite number of elements in which the finite element 
approximation discussed earlier is used. The region R? will be considered 
as a single infinite- or super-element. Within R 2 the asymptotic solution is 
used as the approximation. The hybrid technique is then used to match the 
solutions of the two regions. We shall consider two examples to illustrate 
the idea. 

(a) Plane Elasticity: Consider a plane elasticity problem with zero body 
force and zero deformation gradient at infinity. If neither condition is actu¬ 
ally so, we can introduce a particular solution so that the general problem 
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is reduced to one having zero body force and decaying deformation gra¬ 
dient at the far field. A variational functional for the entire domain is 
written as 

n = ii/2i + n* 2 , 

where Il/ei is the functional for the finite domain R\ which can be the 
sum of the functionals for all the element within the region and 11/22 is the 
hybrid functional for i ?2 which is in the same form as Eq, (19.2:19) 

(1) = f (u T - u T )Tds + [ ^e T D e edA, 

jc Jr 2 1 

where C is the curve separates R\ and R^. We have made an implicit as¬ 
sumption here that the integration over the infinite domain is bounded. We 
assume u as those over the region R\ along the inter-element boundaries. 
The form of u is the similar to that for the crack element as illustrated in 
Eqs. (19.3:3) and (19.3:4). Thus, interelement compatibility in the sense 
of Eq. (19.2:5) is ensure. By assuming T in terms of u as in Eq. (19.2:21) 
and choosing u in terms of asymptotic solutions (satisfying the equilibrium 
equations and decaying to zero at infinity), we can avoid the integration 
over the infinite domain. Equation (1) becomes 



Fig. 19.5:1. An infinite-domain element. 
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(2) n* a = / (u T - u r )T ds + i f u t T ds + \ ( u t T ds 

Jc 2 Jc 2 7r-,oo 

= / 0 (“ t 4 uT )™ s - 

This functional is the same as the homogeneous part of Eq. (19.2:29), 
i.e., the part excludes u t /,, u p , T th , T p , T. The integral over the curve with 
the radius R —> oo vanishes due to the decaying nature of the asymptotic 
solution. 

An asymptotic solution of u can be expressed in terms of complex 
potentials of the following form 

(3) ip(z) = (ai + ia 2 )/z + (a 3 + ia 4 )/z 2 

+ ‘ ' • + (c* 2 p-l + *<* 2 p)lZ P , 

(4) X\z ) = (a 2p+ i + ia 2p+2 )/z + (<* 2 P +3 + ia 2p+i )/z 2 

+ ■ • • + (c* 4 P -i + ia ip )/z p , 

where p is an integer and a’s are real constants. The total number of a’s 
is 4p. It is obvious that the derived displacements and stresses satisfy the 
equilibrium equations and decay to zero at infinity. Using Eqs. (19.3:16) 
and (19.3:25), after some algebraic manipulation, we find 

(5) T c = T x — iT y = G c o!, u c = u + iv = h c a, 
where the components of G c and h c are 

G 2 j -1 = -j(l/z j+1 + 1 /z j+1 )e~ 10 - j(j + 1 )e l0 z/z i+2 , 

G 2j = i[j(l/z j+1 - l/z 3+1 )e~ iS — j(j + l)e ie z/z 3+2 \, 

G 2p + 2 j-i = G 2p+2 j/i - j/z j+l , 

{h c ) 2 j-i = (k/z 3 + jz/z 3+1 )/(2G ), 

(h c ) 2 j = i{k/z 3 - jz/z 3+1 )/{2G ), 

(fic) 2 p+ 2 j — 1 = {hc) 2 p+ 2 j /i — 1/(2 z 3 G), 

for j = 1,2,... ,p. Using u in the form in Eq. (19.3:5) on C, and u c and T c 
of Eq. (5), one can write Eq. ( 2 ) in the form 

nfl 2 = a T k„*q - ^a T k uu a, 


( 6 ) 
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where 

a T k u *q = J u T Tds = a T Re ^ J G^h c ds^ q, 

ot T k uu a — J u T Tds = a T Re G^h cds'j at, 

which are similar to the element matrices defined in Eq. (19.3:28). From 
the zero condition of the first variation of II/j 2 with respect to a, 

k utl a — 0 j 


one can eliminate a from Eq. (6) to obtain 

1 r 

n« 2 = - q kq, 

which is a function of q only, where 

k — k r k -1 k - 

is the equivalent element stiffness matrix of the infinite element. The ele¬ 
ment stiffness matrix can be assembled together with the matrices from R\ 
in the usual fashion to form the finite element equation. 


(b) Long Wave in Potential Flow: Figure 19.5:1 shows an incident wave 
being scattered by a solid body B in the x, y-plane. Under the long wave 
theory for potential flow (Stoker 1957, Chen and Mei 1974), the velocity 
potential 4>exp(—iuit) is governed by 

(7) V • {hV4>) + u 2 <l>/g = 0 in R 1 + R 2 , 


where h(x,y) is the depth of the fluid, and g is the gravitational acceler¬ 
ation. The boundary condition on dB is zero normal flow velocity, i.e., 
d<f>/dn = 0. Consider the case that the water depth is a constant at the far 
field, an incident wave can be written in the form 

(8) a^) 1 = —zpcoexp[ffercos(0 — a)]/u>, 


where r, 9 are the cylindrical coordinates, k = uj/\fgK , co is a constant and 
a is the angle of the incident wave direction. Interested readers can prove 
that d> 7 exp(—iwt) is a plane wave satisfying Eq. (7). The scattered wave 
potential, defined as <f> s — <f> — <f> T , must satisfy the far field conditions 


( 9 ) 



lim \frd> s is bounded, 

r —+00 
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which represents the behavior of an outgoing wave and is called the radia¬ 
tion condition. As before, the hybrid functional can be written as 

(io) n((£, fa = n R i(0) + n R2 (<£, <i >), 

where now 

n = \ [ [/i(V0) 2 -~^ 2 ldA, 

1 JRi L 9 

J - 4>)~ ^g n ds ~\ j ikh(<f) - <f> r ) 2 ds 

+ \j R |MV(<A-^)] 2 -y(^-/) 2 }^, 

in which d<j)/dn is the outward normal derivative along C). In the 

equations above, II/n is a regular functional over a finite domain surround¬ 
ing the body B and fl« 2 involves infinite domain. It can shown that the 
vanish of the first variation of II with respect to ij>, <j> gives Eq. (7) as the 
Euler equations for the two field variables in R\ and R%, Eq. (9) as the rigid 
boundary condition for 4> s (= <f> — 4 >*), an d the matching conditions 



dn 


on curve C. If one makes 4> satisfy Eq. (7) in /? 2 and <j> — (j) 1 satisfy Eq. (9) 
a priori, one can use the Gauss theorem to eliminate the integration over 
the infinite domain in the functional II/j 2 such that 


( 11 ) 




— f hiq 
Jc \ 




which involves only line integrals along curve C in a finite domain. 

To construct the finite element equations, we shall again divide R\ into 
elements as shown in Fig. 19.5:1 and regard /? 2 as a superelement. The 
hybrid element matrices for the infinite domain i? 2 can easily be evaluated 
using Eq, (11). Consider the case where h is constant in f? 2 - We select the 
scattered-wave potential <j> s , which satisfies the Helmholtz equation Eq. (7) 
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and the radiation condition Eq. (9), in the form 
(12) <p s = <f> - 4> ! = a 0 H 0 (kr) 


in which 


N 

+ cos j6 + o/j+M sin j9)Hj(kr) = Ga, 

3 -1 


G = [# 0 (&t) H\(kr)cos6 H\(kr)sv&6 H2{kr)cos29 ■ ■ ■}. 

where H's are the Hankel functions of the first kind and a’s are unknown 
parameters. We assume 4> of R 2 to be the same as (j) of R\ along C. The 
interpolation functions h's for 4> are similar to that of u in the form of 
Eq. (19.3:3) or (19.3:4) depending on whether 4> is linear or quadratic along 
the inter-element boundaries on C. We have 

(13) ^ = hq, 

where all components of h are zero between nodes j and j + 1, except 


for linear h between nodes j and j + 1, or 

hj = (l--i-) h j+l = ( T 

V s j,j+2 J \ s j,j+lJ \ s h 

/ \ 


S j,j + 2 8 


hj+2 


S j,j+l 


— S 


',j+2 s j,j+1 / s j,j +1 

Substituting Eq. (12) and (13) 


\ s j , j+i s j,j+2 ) s j,j +2 
for quadratic h between nodes j and j + 2 
into Eq. (11) yields 

(14) n r 2 = a T k^q - ia T k w a - a T fy - q T f^ , 

where 


« T k,iq 


ot T U 


- J <t>hds , a r k^a = J <t> s ~^hds, 

r Ft a r . 


an 


qT{ *=~Ic 

eliminate a using stationary condition of 11^2 and 


In principle, one can 
obtains 

T1r 2 = ^q T k q - q T f 4- constant 
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where 


k — k T k -1 k 

K - K U <t><P 


f = k: jkTrfrf, + f,. 


These element matrices can be assembled like regular element matrices. 
The size of k can be quite large since it involves the nodes along the entire 
curve C. 

One may also simply assemble the matrices of the infinite element in 
Eq. (14) with the element matrices of R\ without first eliminating a and 
use the assembled equations to solve for q and a simultaneously. 

If C is a circle of radius ?'q, k^ can be inverted analytically. Note that 
= — Jp at r = r 0 in Eq. ( 11 ). In this case, we have 


(15) a T k^a = -irr 0 kh 


2 otlHoHL + ][>f + aij +N )HjHj 


j-i 


where 


H 3 = H } (kr 0 ), H'^^ikro). 

The matrix k, w , is a diagonal matrix and can be inverted easily. A more gen¬ 
eral discussion of the waterwave problem and corresponding finite-element 
solutions can be found in Chen and Mei (1974) and Mei and Chen (1975). 


19.6. INCOMPRESSIBLE OR NEARLY INCOMPRESSIBLE 
ELASTICITY 

Many physical problems involve deformation that essentially preserves 
volume locally. Rubber is often modeled as such a material. In linearized 
theory of elasticity of isotropic materials obeying Hook’s law, the incom¬ 
pressible or nearly incompressible condition is expressed by Poisson’s ratio 
approaching 1/2. This is seen by the ratio of bulk modulus K to shear 
modulus G shown in Eq. (6.2:9) 

K 2(1 + u) 

G ~ 3(1 — 2v) ‘ 

The limit creates problems in elasticity for isotropic materials since the 
constitutive equations Eq. (6.2:7) are 

(Tij — T G(v,i t j -f- llj i ) . 

(recall that aq = x, = y, 13 = z, u 1 = u, 112 = v, 113 = w in index 
notation.) The Lame constant A[= 2 vG/(l — 2u)] becomes infinity at u = 
1 / 2 . 
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An alternative formulation is to include the hydrostatic pressure p as an 
independent field where 


p/K = —e = -(it,* + v tV + te 2 ) = -Ujj , 

in which e is the dilatation as defined in Eq. (7.1:11). A penalty type of 
variational functional (see Sec. 10.10) for an element can be written as 

(1) A II e = ^ e T D/e — a + puk,/^ — u r b dV 
u t T n dS, 


-/ 

JdV, c 


with the penalty function / = K(uk,k) 2 + 2 puk,k +p 2 /K and 


( 2 ) 


D / = D e — a/fD, 



1 sym 

1 1 

0 0 

1x3 3x3 J 


D e is the elastic modulus matrix of the material (see Sec. 18.14 for plane 
elasticity and Sec. 18.16 for 3-dimensional elasticity), 0 is a zero matrix 
with all entries being zero, K is the bulk modulus and a is a constant of 
order 1 chosen to ensure that D; is bounded and positive definite. The 
choice of a is not unique except for incompressible materials. 

In the case of an isotropic material, we choose a in the form 

(i + *), 

where f3 is a selected constant of order 1. Then 

(3) D/ = G(D' + 2/JD), D' = G diag(2 2 2 1 1 1), 


i.e., D' is a diagonal matrix. When the material is incompressible, a be¬ 
comes 3i//(l + i/)(= 1). For any non-negative /?, D; is positive definite. 
If @ = 0, Eq. (2) reduces to the conventional mixed functional (Herrmann 
1965, Tong 1969) involving the two fields u, p. 
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where e' is the deviatoric engineering strain matrix and D' is always 
bounded and given by Eq. (3) for isotropic materials. In Eq. (1), p plays 
the role of the Langrange multiplier to enforce the pressure-dilatation strain 
relation. In the limiting case of incompressible materials, it enforces the 
incompressible condition by requiring the dilatation to be zero. 

The integrand of the functional above involves both strains and hydro¬ 
static pressure and therefore II' a mixed functional. Admissibility simply 
requires that u be C° continuous. Discontinuous p is permitted. The math¬ 
ematical convergence theory for mixed finite element methods can be found 
in Tong (1969), Babuska (1971), Brezzi and Pitkaranta (1974), Oden and 
Carey (1984), and Brezzi (1984). 

In finite element formulation, we express 

(5) u = hq, P = gp, e = div(u) - d d hq = B d q 


where q, p are nodal value matrices of u, p with h and g being their 
interpolation matrices, B ( ; relates the nodal displacement to the volumeric 
strains, and is a differential operator that 


d d 


d_ 

dx 

0 


0 

d_ 

dy 


0 0 


0 

0 

d_ 

dz. 


The interpolation matrix h is in the form of Eq. (18.14:10) for plane or ax- 
isymmetric problems and in the form of Eq. (18.16:13) for three-dimensional 
elasticity. A substitution of Eq. (5) into (1) yields 


(6) 

n e = ^ q T k w q - q T k w p - 

ip T kp P p-q T f, 

where 



(7) 

k qq — f B r D/B Jd£dr]d( , k qp = a f Bjg Jd£dr)d(, 

Jv e Jv c 


k pp = a J Jd£drjd(, f = 

/ h T bdV+ [ h r T dS 

JV e JdV„' 


Note that f is the same as that defined in Eq. (18.16:16). 

In general, we use lower order shape functions for the pressure because 
the continuity requirement for p is lower as compared to that for u. For 
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linear u, we use constant for p(= p\) in the element; for quadratic u, we 
use linear p{= p\ +P 2 x+P 3 y+Piz) and so on. If one assumes discontinuous 
pressure between elements, in the compressible case, one can eliminate the 
pressure degrees-of-freedom at the element level, i.e., 

kppP + k^ p q = 0, or p = -k^k^q. 

A substitution of p into Eq. (6) reduces n e to the standard form 

n e = ^q T kq-q T f, 

where 


k ~ k -1- k k ^k^ 

“■ — T i '-qp n -pp *- q p • 

We then can derive the global stiffness and force matrices by assemblying 
the element matrices in the usual manner. The procedure of first elim¬ 
inating the pressure degrees-of-freedom fails for incompressible materials 
because k pp becomes singular (K becomes infinite). The procedure does 
not offer much advantage for nearly incompressible materials as the terms 
involving k“p can dominate k qq when K is large. It is better off to solve the 
pressure and nodal-displacement equations simultaneously. However, some 
precautions should be taken. For example, an equation corresponding to a 
zero or nearly zero diagonal must not appear as the first equation of the 
global system. When this happens, the direct method of solution cannot 
proceed. 

Displacement and pressure are two independent fields. However, arbi¬ 
trary combinations of displacement and pressure interpolations may not be 
effective. Each pressure degree-of-freedom is equivalent to a constraint to 
the deformation mode. Too few pressure degrees-of-freedom can lead to 
poor approximation. It can also lead to spurious deformation with zero 
strain energy for the functional in Eq. (4), which can make the equation 
unsolvable. For example, pure dilatation with zero pressure (^ xy = j yz = 
-y zx = p = 0, e xx = e vy = e zz ) give zero strain energy in Eq. (4). 

Too many pressure modes will overly constrain the deformation and lead 
to locking. Consider the constant strain triangle with one pressure degree- 
of-freedom. For the element mesh shown in Fig. 19.6:1, the constant volume 
(area) condition for element 2 precludes the horizontal motion of node 1. 
In the mean time, the same condition for element 1 restrains node 1 to 
move only horizontally. As a result, the displacements at node 1 can only 
be zero. With displacements at node 1 being zero, it infers that nodes 2 
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Fig. 19.6:1. Mesh for which incompressibility dictates zero displacement. 


and 3 have zero displacements and so on. We conclude that every node in 
the entire mesh must have zero displacement. The result holds no matter 
how many elements are in the mesh. 

Hughes (1987) introduced a heuristic approach for determining the abil¬ 
ity of an element to perform well in incompressible and nearly incompress¬ 
ible applications. Consider a finite element mesh. Let n eq represent the 
total number of displacement equations after boundary conditions have 
been imposed and n c be the total number of incompressibility constraints. 
For linear problems, n c is the total number of pressure degrees of freedom. 
The constraint ratio r is defined as 


r = n eq /n c . 

Hughes conjectured that r should mimic the behavior of the number of 
equilibrium equations divided by the number of incompressibility conditions 
for the system of the governing partial differential equations, i.e., r — n 
where n is the number of space dimensions. So in two dimensions, the ideal 
value of r should be 2. A value of r less than 2 would indicate a tendency 
to lock. If r < 1, there are more constraints on q than the number of 
q's available. Severe locking will be resulted. If r is much greater than 2, 
it indicates a lack of incompressibility constraint which can lead to poor 
solution. The results (Huges 1987) are illustrated in Figs. 19.6:2 and 19.6:3. 

Another approach to handle the nearly incompressible issue is to split 
the elastic coefficient matrix D e into the G-part (shear) and A-parts (Lame 
constant) (Hughes 1987) 


De=D +AD 
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(a) r=1 (b)r=2 (c) r=4 



1 pressure node • displacement node 

Fig. 19.6:2. Triangular elements with discontinuous pressure 



■ pressure node • displacement node 


Fig. 19.6:3. Rectangular elements with discontinuous pressure 

where D = D' is given in Eq. (3) and D is given in Eq. (2) for isotropic 
materials. For plane strain, 

'l 1 o' 

D = Gdiag(2 2 1), 03 x 3 = 110. 

0 0 0 
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Using the displacement formulation, from Eq. (18.8:23), the element stiff¬ 
ness matrix can be written as 


k = k + k, 

where 

k = f B t DB dV , k = A / B T DBdV B = d e h, 

Jv c Jv c 

in which d e is given in Eq. (18.6:6). 

If X/G is very large, the numerical values of k can much larger than 
k. One simple and important practical way to lower the stiffness of an 
element is to reduce the order of numerical quadrature by 1 below that 
‘normally’ used in evaluating k. For instance, for the Gauss integration, 
if we use p + 1 integration points to evaluate k, we will use only p-point 
to evaluate k. This is a form of selectively reduced integration , which is a 
remedy to eliminate shear locking as discussed before. Malkus and Hughes 
(1978) showed that many elements of mixed formulation are equivalent to 
elements of selectively reduced integration. 
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FINITE ELEMENT METHODS FOR 
PLATES AND SHELLS 


Plates and shells are particular forms of a three-dimensional solid in 
which the thickness is small as compared with other dimensions. Conceptu¬ 
ally, they do not present difficulties for the finite element method. However, 
the 3-dimensional elements for such structures generally would have very 
large aspect ratio (in-plane dimension to thickness), which makes the ele¬ 
ments overly stiff in the out-plane direction. This ‘overly stiff’ phenomenon 
is known as Hocking’ as defined in Sec. 18.20. In order to obtain a solution of 
acceptable accuracy in the 3-dimensional model, it often requires elements 
of size of the order of thickness or less. This would be costly. Approximate 
theories such as the Kirchhoff theory for thin plates and shells and the 
Reissner-Mindlin theory for moderately thick plates and shells have been 
developed to reduce the plate and shell problems to two dimensions. In this 
chapter, we shall discuss the finite element method to model the Kirchhoff 
(Secs. 16.15 and 16.16) and Reissner-Mindlin theories of plates and shells. 
Other approaches, including the reduced integration and mixed formula¬ 
tions to avoid locking, will be discussed also. Throughout this chapter, we 
shall use t as the plate or shell thickness. 

20.1. LINEARIZED BENDING THEORY OF THIN PLATES* 

Let the midplane of the plate be on the x, y plane in a Cartesian system 
of coordinates. The Kirchhoff theory assumes that (1) the deformation in 
the midplane is like that of the plane stress state discussed in Sec. 9.1, 

•With focus on the finite element approach, the linearized small deflection theory of 
plates and shells is disucssed in this chapter. The linearized small deflection theory of 
plates is presented in Sec. 16.15. To solve the nonlinear problem, a fundamental method 
of incremental approach is discussed in Chapter 17. Using finite element method for 
incremental approach, we need the linearized solution at every step. Hence the material 
discussed herein is important. 
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(2) strains vary linearly along lines normal to the midplane, and (3) lines 
normal to the midplane remain normal to the deformed midsurface after 
deformation. Under the Kichhoff assumptions, the state of deformation off 
the midplane is determined only by the lateral deflection, w, normal to the 
midplane. The displacements at any point of the plate are given by 

dvj dw 

u = uq — z-^— , v = Vo — z—— , w — w , 

ox dy 

where z is the distance from the midplane and uq, vq are the in-plane dis¬ 
placements determined by plane-stress analysis based on the assumption 
(1) mentioned above. Thus, uo, vq associate with midplane stretching and 
shear, w relates to bending, and dw/dy and —dw/dx represent the rota¬ 
tions along the x~, y- axes of the midplane, respectively. In the linearized 
theory of plates, stretching and bending are decoupled. Therefore, in Secs. 1 
through 4, since only plate bending is considered, we shall drop terms in¬ 
volving u 0 , no- It is only after Sec. 4, when shells are considered, we will 
include the midsurface in-plane displacements to account for stretching and 
bending coupling. 

The strains associated with the off-midplane deflections are proportional 
to the changes of the curvature of the midplane and called the bending 
strains. These strains forms a column matrix k with components consisting 
of the second derivatives of w. 


( 1 ) k = \k x Ky K X y\ T = d e du;, 

which represents the change of curvature as discussed in Sec. 16.15, and 
d e and d are matrix differential operators defined in Eqs. (18.11:5) and 
(18.11.6): 


' d 

0 

d ' 

T 

' d ' 

dx 

dy 

, d = 

dx 

0 

d 

d 


d 


dy 

dx 


dy 


d P = 


Thus d e d is a second order matrix differential operator that 
d 2 d 2 „ d 2 1 


( 2 ) 


(d e d) 


dx 2 


dy 2 dxdy\ 

The stress measures are bending moments, M, and transverse shear forces, 
Q, per unit length along lines on the midsurface of the plate. They are, in 
matrix form, 


( 3 ) 


M = [M x M y M xy } T , Q = [Q x Q y ) T . 



Sec. 20.1 


LINEARIZED BENDING THEORY OF THIN PLATES 


797 


As discussed in Sec. 16.15, Q is related to M by Eq, (16.15:13), and 
M is related to the lateral displacement w through the constitutive law 
Eq. (16.15:19), which in matrix form, is 

(4) M = -Db« = -Dbdedu;, 

where Db is the bending rigidity matrix. For isotropic elastic material, Db 
is (Sec. 16.15) 


12(1 - 1/ 2 


0 0 (1 - v)j 2 


where E represents Yong’s modulus, v is Poisson’s ratio (see Sec. 6.2), and 
t is the thickness of the plate. The shear Q is related to M through the 
moment equilibrium equation (16.15:13). 

Consider a domain A with the following two sets of boundary conditions: 

( 6 ) w =us on dA w , 


HxQx ~b n yQy 


on 8Aq , 


n\M x + 2 n x n y M xy + n 2 y M y = M nn on 8 Am , 


where 

M ns = n x n y (M y - M x ) + ( n 2 - n 2 )M xy . 

The overbar denotes prescribed quantities with M nn , M ns being the pre¬ 
scribed normal and twist moments, n x , n y are the direction cosines of the 
unit outward normal to the boundary, M’s, Q’s are in terms of w through 
the constitutive law, d/dn denotes the outward normal derivative along the 
boundary, and the subscripted 5A’s are part of the domain boundaries with 
the subscripts identifying the prescribed quantities. At any given boundary 
location, one and only one condition from each of the two sets above can be 
specified. Equations ( 6 ) and ( 8 ) are rigid and Eqs. (7) and (9) are natural 
boundary conditions. 

The functional for the principle of minimum potential energy theorem is 

n= [ [ ^(d e du)) T D6d e dw - pw dxdy+ f M nr J^-ds — [ Qwds, 
JA L 2 JSAm dn JOAq 



798 


FINITE ELEMENT METHODS FOR PLATES AND SHELLS 


Chap. 20 


where p = p{x, y) is the distributed lateral pressure on the plate. The func¬ 
tional can be written as the sum of the functional of all elements provided 
that w is C l continuous 

n = £ n Be , 

all elements 

where Il Be is the functional of an element in the same form as II except 
that the domains of integration are replaced with the corresponding do¬ 
mains of the element. We further divide the element functional into two 
parts, namely, 

(io) n Be = n 6fi — ih>/e 

where 

TIb e = I i {d e dw) T Dbd e dwdx dy , 

•M, 2 

n bfe — [ pwdxdy - f M nn —ds+ f Quids, 

Ja „ JdA Me un JdA Qc 

in which A e is the area of the element, and dAm e and BAq c are the portions 
of the element boundary coinciding with dA m and OAq, respectively. Let 
us introduce additional subscripts for ITs such that n Be is the potential 
energy, II (, e represents the strain energy, and Ibj/ e denotes the external work 
on the element associated with bending deformation. This is to distinguish 
them from the corresponding quantities associated with membrane action 
to be discussed later. 

For convergence, the shape functions must be able to approximate the 
state of constant curvature within an element. This requires that the shape 
functions have the complete quadratic terms. The governing differential 
equation for w is fourth order. The condition number (see Sec. 18.5 and 
Tong 1970c) of the resulting stiffness matrix grows as 1/e 4 where e is the 
minimum dimension of all elements. Thus the condition number grows 
rapidly as the size of elements decreases which can lead to round-off problem 
in the finite element solution for fine meshes. 

Determination of C 1 shape functions is much harder than the determi¬ 
nation of C° functions. Elements with the shape functions that preserve 
continuity but may violate slope continuity between elements are called 
non-conforming or incompatible elements (see Sec. 20). For incompatible 
elements, we have to rely on the patch test to ensure the convergence of the 
approximate solution. Constructing multi-dimensional C 1 -interpolations 
created considerable difficulties in the early development of the finite ele¬ 
ment method, especially for triangular elements. Various conforming plate 
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elements based upon the displacement model have subsequently been devel¬ 
oped. Unfortunately, the resulting compatible shape functions are generally 
complicated. Hybrid or mixed formulations are alternative approaches to 
circumvent the difficulty. 

The different elements that have evolved are distinguished by the type of 
shape functions, generalized coordinates at nodes, geometrical shape, and 
the number of nodes. Clough and Tocher (1966) were among the first who 
successfully constructed a compatible low-order triangular bending element. 
The element was very stiff that the solution using the element converges 
very slowly. Bell (1969), Cowper et al. (1969), Holand and Bell (1972) 
derived conforming higher order triangular elements that use six quanti¬ 
ties w, w iX , w tV , w <xx , w, xy , a nd w iVy at the vertices as the generalized 
nodal coordinates. The subscripts with a comma denote partial differenti¬ 
ation with respect to the subscript variable. This convention will be used 
throughout the remaining of this book. Clough and Felippa (1968) devel¬ 
oped a similar bending element for quadrilaterals. The elements is quite 
accurate but computationally expensive. For many structural applications, 
discontinuity is natural that abrupt changes in thickness or composition 
due to discrete stiffening or members meeting at angles is part of design. 
At such locations, continuity should not be enforced. Conforming elements, 
especially the lower order ones, may yield inferior accuracy as compared to 
non-conforming elements. Therefore, non-conforming elements are widely 
used in practice. 

Higher order bending elements can improve the accuracy. Unfortu¬ 
nately the higher order conforming elements are even more complicated. 
Therefore, if one weighs the expenses against the incremental increase in 
accuracy, simpler non-compatible elements can be a good choice. It should 
be emphasized that in specific applications where accurate stress (bending 
moments, which are proportional to the second derivatives of w) prediction 
is important, especially in regions of high stress gradients, higher-order 
elements may be necessary. 

In the following, two non-compatible elements based the variational 
functional given in Eq. (10) will be developed. 

Incompatible Rectangular Element. Consider the rectangular plate 
element in the x, y-plane shown in Fig. 20.1:1 and take the nodal values of 
w, W' X , w iV at the four corners as the generalized coordinates. The element 
has a total of 12 degrees-of-freedom. The transpose of qs is: 

qs = MCi, m) W'xiti , vi)w, v (£i w ,x(£i, m) w, v (6, %)] , 



800 


FINITE ELEMENT METHODS FOR PLATES AND SHELLS 


Chap. 20 



Fig. 20.1:1. A rectangular element. 


where qs is a column matrix, £i =771 = - 1 , £2 = r ? 2 = 1 , and £ = x/a, 
77 = y/b are the natural coordinates of the element. We approximate w 
within the element as a polynomial in terms of 12 parameters: 

( 11 ) w(x,y) — or + a 2 x + a 3 y + a 4 x 2 + a 5 xy + a 6 y 2 + a 7 x 3 + asx 2 y 

+ a 9 xy 2 + a 10 y 3 + a n x 3 y + a u xy 3 = Pa , 

where P and oc are matrices defined as 

P = fl x y x 2 ■■ ■ xy 3 }, 

(1x12) 

a 1 — [ax a 2 a 3 • • • a 12 ]. 

Along lines x or y — constant, the displacement w is cubic, which can be 
uniquely defined by four parameters. Since the inter-element boundaries 
are lines with either x or y being constant, the two nodal values of w and its 
tangential derivatives at the two ends of a boundary of the element define 
w along the boundary. The tangential derivative of w along the boundary 
x = constant is w y , whereas that along y = constant is w tX . As a result, 
if two neighboring elements have the same nodal values, continuity of w is 
assured along their common boundary. From Eq. (11), since both w tX and 
w tV vary as a cubic function, the derivative of w normal to any boundary 
is cubic in general. The normal derivative of w along an element boundary 
is matched with that of the adjacent element only at the two end-points 
of the boundary line. Therefore, continuity in the normal derivative of w 
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between the two points cannot be assured. Thus, the interpolation function 
given in Eq. (11) is nonconforming. 

Using more degrees of freedom for each node can resolve the problem just 
described. In particular, if one adds the twist w <xy as an additional degree- 
of-freedom at each node (thereby resulting in an element of 16 degrees-of- 
freedom with four at each node), the uniqueness of normal slope of w along 
the inter-element boundaries, therefore compatibility, can be established. In 
this case, four additional terms (a\zx 2 y 2 + aux 3 y 2 + oqs x 2 y 3 + +ai6a; 3 y 3 ) 
are added to Eq. (11), thereby allowing higher-order matching between 
adjacent elements. 

Further development of the element may take two alternative appro¬ 
aches. In the first approach, one expresses w in terms of q’s and the shape 
functions: 

2 2 

(12) w(x,y) = EEfc ,(£, *7)10(6, fy) + h xi j (£, rj)w x (£i , rjj ) 

*=1 j= 1 

+ hyijit, Vj)} = hqs . 

where x — a£, y = by, and 

h = [fin h x a hy n h.21 h X 2i h y 21 /122 h x 22 ••• h y u ), 


in which 

hij = ^ (2 + fo + TjTlj - £ 2 - t / 2 )(£ + &)(»? + ni ). 

(13) h xi j = ^ (£ 2 - 1)(£ + &)(r? + yj)a , 

hyij - J {V 2 ~ 1)K + 6)(»? + Vj) h 
(i, j not summed). A substitution of Eq. (12) into Eq. (10) yields 

(14) n Se = i qgk B q B - qsfs , 


where the matrices k B , fg are 


(15) k£ = / (dedh^Dbdedhdzdy, 

JAe 

(16) fs = / ph T dxdy - [ M nn ^—ds+ f Qh T ds. 

JAe JdA M „ ° n JdAqe 
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In terms of the natural coordinates, Eqs. (14) and (15) can be written as 


(17) 

kB = 

(18) 

h=ab I-J' l 


f -jj dh T 

— I M „ ——— ds -f- 

J3Am 


y)h T d£dr] 

f Qh r ds , 

J dAQe 


dn 


where 


1 d 2 h T 1 d 2 h T 2 9 2 h 7 

a? db 2 dr] 2 ab d£dr/ 


They are similar in form as Eqs. (18.14:15) and (18.14:16). Equations (16) 
and (17) can be evaluated analytically if the bending modulus matrix Dj, 
and the pressure load p are polynomials, or numerically otherwise. 

In the second approach, we obtain the relation between a’s and q’s in 
the matrix form 

Ac* = q, 


by evaluating w and its partial derivatives at the nodes. We evaluate the 
element matrices in terms a’s using w as defined in Eq. (11). We then 
express a’s in terms q’s by the transformation matrix A -1 above. The 
components of the element matrices in terms of a’s are mostly zero if Dj, 
and p are constant. 

Distorted quadrilateral elements based on the isoparametric transforma¬ 
tion of Eq. (18.13:3) perform badly. In this case, the interpolated function 
cannot represent exactly the deformation of constant curvature except for 
parallelogram (Dawe 1966, Argyris 1966). 


Incompatible Triangular Element. The early development of triangu¬ 
lar elements encountered considerable difficulty when attempting to con¬ 
struct the shape functions in terms of x, y directly. For an element with 
three degrees-of-freedom per node, as shown in Fig. 20.1:2, there are only 
nine parameters whereas the complete cubic polynomial has ten terms. Sev¬ 
eral possibilities have been investigated to limit the number of unknowns 
to nine but various problems arose, which include the lack of symmetric 
appearance of the ten terms, the non-uniqueness of shape functions, etc. 

The asymmetric appearance can be avoided by using the triangular 
coordinates. Bazeley et al. (1966) assumed w in the form 



Sec. 20.1 


LINEARIZED BENDING THEORY OF THIN PLATES 


803 



(Wj,W x2 ,W y2 ) (W ,Wy 3 ) 

Fig. 20.1:2. Triangular element with three degrees-of-freedom per node. 

w = c*ici + a 2 C 2 + ct3C3 + a 4 ci (<rf + —'j + a 5 ?i (? 3 + 

. , ( 2 . ftft\ 

+-!■ “9ft (^2 + ~J ■ 

Expressing a’s in terms of the nodal values w if w }X i , u>, y » of the three nodes, 
one obtains the shape functions associated with the nodal parameters. 

The element failed the patch test for arbitrary mesh configurations. 
The shape functions were subsequently amended to pass the patch test by 
Specht (1988) who assumed w in the form 

w = [<d,?2i C3> ?i<»2, *>2^3)C 3 ?i, 

C 1 C 2 + Ci<T2?3{3(1 - /isHi - (1 + 3/r 3 )c2 + (1 + 3/i 3 )? 3 }/2, 

clft + <a?2T3{3(l — / 11 H 2 — (1 + 3^i)c 3 + (1 + 3/ii)d}/2 , 

c 3 ?i + <?iC2ft{3(l - M 2 )ft - (1 + 3/i 2 Hi + (1 + 3/i 2 )<T2}/2]a = Pa , 

where /i< = (If — with i, j , k being the cyclic permutations of sub¬ 

scripts 1, 2, 3 and /’s being the side lengths of the triangle. The parameter 
fit relates to the normal derivative on side i by 

d__h_\9_ JL_oJL (Jt _£_Y 

dn ~ 4A d^j + d<; k 1 /<! \9c* d^J 
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The shape functions corresponding to the nodal parameters w, w <x , w tV at 
node i are 


^3(i-l) + l 


Ci “I - 2(P^-j-6 Pfc+6) 

^3(t—1) + 2 

= 

bj (-^c+6 b k P l+6 

.^3(i-l)+3. 




where 6’s and c’s are defined in Eq. (18.9:7) and P’s are the components of 
P. It can be shown that the shape functions are C° continuous for arbitrary 
/i’s, which are chosen to satisfy the patch test. 

The element matrices are given by Eqs. (14) and (15) with A e being the 
area of the triangular element. The differential operator for the element 
stiffness matrix in Eq. (14) is defined in Eq. (20.1:2) as 

( 1Q ) d * d = ^2 E I 6 * 6 -* CiC * 2biC ^d^. ■ 

i,j=X * 3 

The integrals in Eqs. (14) and (15) can be evaluated analytically using 
the general formula Eq. (18.10:10). However, the explicit integration is 
lengthy and error prone (Cheung et al. 1968). It is more practical to use 
numerical integration. 

It is possible to construct simpler elements of C° continuity. Morley 
(1971) first proposed an element with w at the triangle vertices and its 
normal slope at the midsides of the element as nodal parameters. The 
shape functions form a complete quadratic polynomial and can represent 
constant curvatures. If the material and the thickness of the element are 
uniform, this element is identical to the hybrid stress element of the same 
displacement at the element boundaries because the constant curvatures 
satisfy the homogeneous moment equilibrium equations. With constant 
moments, the element satisfies exactly interelement equilibrium conditions 
(Hinton and Huang 1986, Zienkiewicz et al. 1990). 

The discussion up to this point has been focused on the approaches 
to overcome compatibility difficulties of the displacement model of plates. 
Alternative is to use the hybrid stress method (Pian 1964) to be dis¬ 
cussed later. The success of Pian’s approach in resolving the compatibility 
dilemma lies in its flexibility in the element-formulation, whereby stress 
and displacement fields are assumed over different portions of the element. 
Hybrid stress plate elements have yielded excellent results for stress dis¬ 
tribution, deflection and vibration (Mau et al. 1973, Pian and Mau 1972, 
Rossettos et al. 1972, Spilker and Munir 1980). 
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20.2. REISSNER-MINDLIN PLATES 

For moderately thick plates (Fig. 16.15:5) the transverse shear stresses 
t xz , r yz across the thickness of the plate become significant. The Reissner- 
Mindlin plate theory (Reissner 1946) includes transverse shear by allowing 
the rotations of the fibers normal to the midplane about the x-, y-axes 
to be different from the slopes of the midplane w <y and —w tX . Like thin 
plates, the in-plane stretching and shear decouple from the bending and 
transverse shear deformations and can be considered separately. The theory 
still assumes that the stress normal to the plate is zero, i.e., a zz = 0, and 
that the displacements off the midplane are 

(1) u — —z6 x , v = —z6 y , w = w(x,y), 

where z is the distance off the midplane, and 0 y and —6 X are the rotations 
of the cross sectional plane about the x- and y- axes, respectively. The 
transverse shear strains are then w :X — 0 X and w <y — 0 y . Strictly speaking 
the assumption n zz = 0 contradicts the assumption w = w(x,y). However, 
the inconsistency does not affect the usefulness of the resulting theory in 
practical engineering applications. 

Following the procedure for the Kirchhoff plate theory, one can derive 
the finite element equations for Reissner-Mindlin plates: the bending and 
transverse shear strains and the rotation matrices are defined as 

K x 

(2) K = Ky 

The matrix constitutive law is 

(3) M = —D h d e 0 , Q = D s (dic - 9 ), 

where D;, is the bending rigidity matrix and D s is the shear rigidity ma¬ 
trix. Both Dft and D s are symmetric and positive definite. For isotropic 
materials, Df, is given in Eq. (20.1:5) and 

(4) D s = ntGI , 

where k(= 5/6) is a constant to account for the nonuniform shear strain 
distribution through the thickness. The element functional for the principle 
of minimum potential energy becomes 



lx 

\0 X ] 

d e 0, 

1 — = dw — 0, 

9 = 


ly . 

. . 


( 5 ) 


rige = Ilbe + If^e — Tlbfe , 
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where 

(6) n 6e 

(7) n se 

( 8 ) U bfe 


where II;, K , II se , II6are associated with the bending energy, the transverse 
shear energy and the work by external loads, respectively. The boundary 
conditions are divided into three sets: 

Set 1. 

(9) w = w on dA we , 

(10) W'xQx T ^"yQy “ ' Q on 9AQ e , 

Set 2. 

( 11 ) n x 0 x + TlyOy 

(12) n 2 x M x + 2 n x n y M xy + n 2 M y 
Set 3. 

(13) -n y 9 x + n x 6 y = 9 S on dA 6se , 

(14) n x n y (My - M x ) + (n 2 x - nl)M xy = M ns on dA Mse , 

where M’s and Q's are in terms of 9 X , 9 V and w through the constitutive 
laws given in Eq. (3), 9 S , —9 n are the rotations about the normal and the 
tangent to the boundary, and the subscripted cM’s are part of the element 
boundaries that are also the domain boundaries with the first subscript 
denoting the prescribed quantity. At any given boundary location, one and 
only one condition from each of the three sets above is to be specified. 
Equations (9), (11) and (13) are rigid boundary conditions while Eqs. (10), 
(12) and (14) are natural for the functional Eq. (5). The admissibility 


= 9 n on dA 6 ne , 

= M nn on dA Mne ; 


(d e 0) T Dfcd e 0 dxdy , 


i j 7 T D s 7 dxdy, 

J 

JdAM„ K 


= / pw dxdy 

Ja, 


M nn 9 n ds 


M ns 9 s ds + 


/ 

Jdf 


Qw ds , 
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requirements are: 0’s are C° continuous, w is piecewise continuous and 
both of them satisfy rigid boundary condition(s) if prescribed. 

In the Reissner-Mindlin theory, there are three boundary conditions 
at a given point as compared to two for thin plate theory. The so-called 
simply supported condition (i.e., prescribed w and M nn in the thin plate 
theory) refers the specification of w, M nn and M ns at the boundary as soft 
support (a more realistic condition) and the specification of w, M nn and 0 S 
as hard support. A boundary subjected to both rigid and natural conditions 
is called a mixed boundary , e.g., a boundary with prescribed displacement 
and moments. Simply support is a mixed boundary condition. 

The functional in Eq. (5) involves the first order differentials of 6 X , 9 y , 
and w. The bending strain energy depends on the rotations in a manner 
analogous to the dependence of membrane strain energy on the in-plane dis¬ 
placements. Thus, the formulation of the bending part of Reissner-Mindlin 
plate elements is same as the formulation of plane elastic elements. One 
simply uses the C° shape functions derived in Sec. 18.14 [with the sub¬ 
stitution of 0 for u in Eq. (18.14:9) and D& for D e in Eq. (18.14:13)] to 
construct the element matrices for bending. 

To evaluate the strain energy associated with the transverse shear, one 
can use the same type of shape functions for w. In this case, 0 will contain 
higher degree terms not present in w tX and ru y . These higher degree terms 
cause transverse shear locking in thin plate applications. For example, for 
a pure bending deformation, the change of curvature is constant and the 
transverse shear is zero. This implies that 0 is linear and w quadratic. 
Therefore, unless the shape functions can correctly represent a quadratic 
field, locking will occur. Increasing the order of the approximation of 9 and 
w can eliminate or minimize locking. However, this will be at the expense 
of increase in the number of degrees-of-freedom per element. One can get 
the similar result by increasing the order of approximation for w to one 
order higher than that of 6. 

A simple alternative is to use uniformly or selectively reduced integra¬ 
tion. In uniformly reduced integration, both the bending and shear terms 
are integrated with same lower-order integration rule. In selectively re¬ 
duced integration, the bending term is integrated with the normal rule, 
whereas the transverse shear term with a lower-order rule. Unfortunately, 
reduced integration introduces spurious zero energy mode(s), which can 
cause serious errors in applications. A summary of integration rules for 
quadrilateral elements and their associated zero-energy modes for the case 
of parallelogram is given in Table 20.2:1. 
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Table 20.2:1. Reduced integration rules and the associated zero-energy modes for thick 
parallergram plates. 


Element 

4-node 

8-node (Serendipity) 

9-node (Lagrange) 


shape 

Bilinear 

Incomplete 

Biquadratic 


function 


biquadratic 



for w, 9 X , 6 y 





Selectively 

lxl shear 

2x2 shear 

2x2 shear 


reduced 

2x2 bending 

3x3 bending 

3x3 bending 


integration 

(SI) 

(S3) 

(S3) 


Zero-energy 

1. w = tv, (G) 


1. w = t 2 V 2 


modes 

9 X = 9 y = 0 


3 



2. ui = 0, 6 X = y 


6* = Oy = 0 (G) 



e v = -* (G) 




Uniformly 

1 x 1 (Ul) 

2x2 (U2) 

2x2 (U2) 


reduced 

(Gauss station 

(Gauss stations 



integration 

£ = V = 0) 

t = g = ±lV3) 



Zero-energy 

In addition to 

1. w = 0 (N) 

(In addition to that of S2) 

modes for 

those of SI 

= i (v 2 - 1) t.T 

2. tu = 9 X = 0 (G) 


parallelogram 

3. w = 8y = 0 

-v(0-^ a )v, x 

°v = {e-\){v 2 - 

1 ) 


9 x = tv (G) 


3. w = e y = 0 (G) 



4. w — 9 X = 0 

9y=t(v 2 ~ l)t,v 

** = (* 2 -i) (0- 

1) 


Oy = tv(G) 

-V {t 2 ~ |) V,y 

4. w = 0 (N) 




(t,x,t,y,V,x,V,y 

€ (n 3 - 4) 




are constant) 

-v (t 2 - i) v,x 





&V — t {v 2 — 5) t,y 





-v (t 2 ~ 5) V,v 



S - selectively reduced integration; U - uniformly reduced integration; 
G - global communicating mode; N - non communicating mode. 


Problem 20.1. The Legendre polynomial /,,(£) of degree n satisfies the 
recursive formula 

(n + l )/„ +1 (0 = ( 2 n + l )*/„(0 - n /»_,(0 . 

Show that 

(a) MO = 1, h(0 = 0 MO = (30 - l)/2, MO = m 2 - 3)/2,.... 
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(b) for parallelogram, the displacement and rotations 

v> = 0 , 9 X = f' n {£)fn{ri)t.,x - fn{£)fn(v)v,x > 
dy = fUt)f«(v)t.V - M£)fn(v)v,y 

give zero bending and transverse strains at the integration points of the n x n 
Gauss quadrature, where (.)' denotes the derivative of the function. Note that 
fn(£), fn(v) equal to zero at the corresponding Gauss points. 

Locking Modes of 8-Node Quadrilateral. An 8-node serendipity ele¬ 
ment of general quadrilateral shape other than parallelogram cannot inter¬ 
polate w = x 2 correctly, because the shape functions include only the terms 
1, V, £ 2 , t? 2 , £V 2 , while x 2 {= E® =1 hi(£, T])xi} 2 } involves £ 2 r/ 2 . The 

incorrect interpolation will not only introduce spurious shear, causing lock¬ 
ing, but also make the element fail the patch test. A remedy is to use 
reduced (2 x 2) integration (Zienkiewicz et al. 1971). The reduced integra¬ 
tion element has received wide use, but behaves poorly in the thin plate 
limit (Pugh et al. 1978). Another remedy is to add a term to the shape 
functions to ensure correct interpolation of quadratic polynomials (MacNeal 
and Harder 1992). The 9-node Lagrange element (Fig. 18.13:3) includes the 
term £ 2 r/ 2 in the base functions. Unfortunately, the 9-node element has an 
additional spurious communicating zero energy mode, which if not sup¬ 
pressed can destroy the accuracy of analysis. Hughes and Cohen (1978) 
introduced the Heterosis element, which modifies the 9-node Lagrange ele¬ 
ment by using just the 8 boundary nodes for w while 9 nodes for 0. They 
used the 2x2 integration rule for transverse shear and the 3x3 rule for 
bending curvature. 

To illustrate locking, consider the deformation 

9 x ~x 2 y, 6 y = ^x 3 , w = , 

with the curvature and transverse shear strains 

(15) ftxx —— 2 xy , f^xy ~ 2:r , lx = ly ~ ^yy ~ • 

An 8-node rectangular (2a X 26) element with bi-quadratic interpolation 
functions, in general, cannot correctly interpolate a; 3 . The finite element 
representation of such deformation is 

9 X = x 2 y , 6 y = i xa 2 , w = i a 2 xy , 

( 16 ) k xx — 2 xy , K xy — x 2 + a 2 / 3 , 

lx = y(x 2 ~ a 2 / 3 ) • 


ly ~ K yy ~ ® i 
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The transverse shear strain is spurious and K xy is only approximate. 
The ratio of spurious strain energy from the strains given in Eq. (16) to the 
correct strain energy from Eq. (15) is 

(nbe)finite element _ - / 4 CZ l(i\/ (l 10 

(Inexact “ + [sW*' 3P ) / [P + 9(1 - l/) J ’ 

which is proportional to the square of the element size-to-thickness ratio 
(a 2 /f 2 ), where v is Poisson’s ratio. For a thin plate element of aspect ratio 
(a/ 6 ) about 1 , the above energy ratio can be very large if the element size is 
much bigger than the plate thickness, which means locking for such defor¬ 
mation. For 2x2 reduced integration, shears are evaluated at x = ±a/\J 3. 
At the Gauss points the shears are correct and thus locking is avoided. Even 
though the finite element representation of the twist curvature K xy is incor¬ 
rect, the influence is not serious. The shear locking mode and the incorrect 
twist curvature are not the elastic modes required for convergence, so that 
convergence is slow but not affected when the size of elements progressively 
reduces. 

Problem 20.2. Consider a beam along the x-axis subjected to a concen¬ 
trated load at x = 0, show that the deflection near the origin can be expressed in 
the form ao + aix + a 2 X 2 + aax 3 H-, where 03 ^ 0. 

Four-Node Element. A four-node quadrilateral cannot correctly rep¬ 
resent a quadratic displacement field. However, if there are locations at 
which the first derivatives of a quadratic field are correct, one can obtain 
the correct strains by using only the values at those points. For example, 
the spurious transverse shear strains j x = x and y y — y cause locking for 
pure bending (0 X = x, 6 y — 0 , w = ,t 2 / 2 , or 6 X — 0 , 0 y = y, w = y 2 /2, 
see Sec. 18.20). But, their values at x, y — 0 are correct. One can obtain 
correct shear strains by using only the value of at x = 0 and the value 
of 7 y at y = 0. Hughes et al. (1977) used a single-point quadrature for 
both transverse shears. MacNeal (1978) used the values of y x at £ = 0, 
Tj = ±l/\/3 and the values of 7 y at / = ±l/\/3, rj — 0. Both approaches 
work well for rectangular elements but fail to eliminate shear locking for 
elements of other shapes. 

Zienkiewicz et al. (1971), Pawsey and Clough (1971), Cook (1974), 
Zienkiewicz and Hinton (1976), and Pugh et al. (1978) showed the per¬ 
formance of rectangular elements by analyzing a square plate ( 2 a x 2 a) 
subjected to a concentrated load at the center. As a/t. becomes large, the 
results of the 8 -node serendipity element (Fig. 18.13.3) with full integration 
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deteriorates rapidly from the thin plate solution. Reduced integration im¬ 
proves the result drastically for the simply supported case. If the boundaries 
are clamped, the answers are not satisfactory even with reduced integra¬ 
tion when a/t is very large. The 9-node Lagrange element with reduced 
integration performs well in most circumstances. 

Elements with Discrete Constraints. For general quadrilateral ele¬ 
ments, reduced integration generally fails to eliminate locking. One alter¬ 
native is to impose discrete constraints on the shear strains. In evaluating 
Hge of Eq. (5), one assume 


(17) 7 = 

7* 

= P(£,*?)a, 

e = 

e x 


ly 



6y 


where P is a polynomial matrix of the natural coordinates, a is a column 
matrix of unknown parameters, and hg, are interpolation matrices asso¬ 
ciated with nodal parameter matrices qg, q^, for the field variables 6 and w, 
respectively. Rather than evaluating the transverse shear strains 7 accord¬ 
ing to the strain/displacement relations Eq. (2), one uses the approximation 
of Eq. (17) for 7 in Eq. (7) and obtains 

(18) n sa = ^ a T (/ pTDsP dxdy'j a T k sa a. 

The o’s are to be expressed in terms of the nodal values of 0 and w by col¬ 
locating the tangential component of 7 at selected points with that derived 
from the assumed 6 and w according to Eq. (2): 


a = Agqg + Au,q, 


Then Eq. (18) can be written as 



where 

k 3 = [ Ag A w ] T k so [ Ag A w ] . 


The other element matrices are evaluated according to Eqs. ( 6 ) and ( 8 ). 
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Hughes and Tezduyar (1981), and MacNeal (1982) assume 


7 = 


di 

a i+3 


for triangles and 


7 = J -1; y = J -1 


1 

0 


V 

0 


0 


1 


0 

£ 


a, 


for quadrilaterals, where 7 is the strain associated with the natural coordi¬ 
nates and J -1 is the inverse of the coordinate transformation matrix: 


J- 1 = 


1 


x,( y, v -y* x <v 


V'lT) 

—x.. 


-V* 

x>t 


The tangential shear strain along each side of the assumed 7 is set equal to 
that derived from the assumed 6 and w at the midpoint of each side. Dis¬ 
crete constriant. has achieved varied degrees of success in reducing locking. 


Problem 20.3. (a) De¬ 

rive the element matrices of 
a quadrilateral element with 
8 degrees-of-freedom (DOPs) 
for 0 and 8 DOFs for w as 
shown, (b) Show that the el¬ 
ement no longer locks for the 
deformations 

0 X = x , Q y = 0 , w = x 2 /2 , 
and 



(w,0 x ,e y ) 

• w node 


o 8 node: {0* 9 y ) 


8x = 0, 6 y = y, w = y 2 /2 . 


(c) Program the new el¬ 
ement and solve the bend¬ 
ing of a cantilever plate using 
a rectangular mesh and the 
mesh as shown in Fig. P20.3. 
Compare the results to the 
quadrilateral elements with 8 
DOFs for 9 and 4 DOFs at 
the vertices for w. 


Ai 


P v I 

^ »F 


P20.3. 



Sec. 20.3 


MIXED FUNCTIONALS FOR REISSNER PLATE THEORY 


813 


Problem 20.4. Repeat Problem 20.3 with triangular elements of 6 DOPs 
for 9 and 6 DOFs (3 at the vertices and 3 at the midsides) for w and compare 
with the results of triangles of 6 DOFs for 9 and 3 DOFs at the vertices for w. 
The mesh is the same as before with each quadrilateral replaced by two triangles. 


20.3. MIXED FUNCTIONALS FOR REISSNER PLATE THEORY 

A mixed Reissner functional (Herrmann 1966, Pian and Tong 1972, 
1980, 1987) for an element can be written in the same form as Eq. (20.2:5) 

(1) II.Be = IRe + n se — n /e , 


where TTfc e , I Ib/ e are the same as those of the displacement formulation in 
Eqs. (20.2:6) and ( 20 . 2 : 8 ) while 


( 2 ) 


J a Q T (dw - 6) ~ l - Q^Q 


dxdy 


is the Reissner functional for the transverse shear energy where Q is a new 
independent field variable. An alternative expression of II se is 


( 2 a) 


n se — f 

JAe . 


J T D a (dw -0)-\ 7 T D s 7 


dxdy, 


where 7 = \^ xz 7 yz ] T is an independent field representing the transverse 
shear strains within the element. Judicious selection of Q or 7 can provide 
flexibility to avoid shear locking. 

The admissibility requirements for the independent fields ( 0 , 7 ,tn) in 
Eq. (2a) are C° continuity for 9 and w or 7 „, the component of 7 normal 
to the interelement boundaries. We assume 


(3) 6 = heq#, 7 = h 7 /3, w - 


and derive the element matrices from Eq. (1), where h’s are shape function 
matrices and q’s, /3 are parameter matrices associated with variables de¬ 
noted by their subscripts. Normally, one chooses h’s to assure that 0 and w 
are C° continuous while Q or 7 is independently assumed for each element. 
In this case, f3 can be eliminated at the element level from Eq. (2a). A 
possible h 7 is 


1 0 x 0 y 0 

(4) h 7 = 

0 1 0 y 0 x 

One can also assume h 7 in terms of polynomials of the natural coordinates 
£, rj. Judicious selection of Q or 7 to relieve locking will soon be discussed. 
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A substitution of Eq. (3) into Eq. ( 1 ) yields 


(5) 


= 2 kfcqo + P T {gywqw ~ gySqe) 
- ~ & T c(3 - qlf w - q'gfo , 


where the first term on the right hand side is derived from 11 ^, the second 
and third terms from IE,,, and the last two terms from IT/ e . Thus 

k b = (d e hg ) 1 D h d e hg clxdy = / Rj,D b B b dxdy, 

Ja c J A„ 

g-yw = / h'D s dh w dxdy, gyO = / h^D g h e dxdy, 

J A r JAe 

c = / h^D s h -ydxdy, 

J A „ 

fw=[ hip dxdy + j h iQds, 

JA c JdAQ« 


to = - / h e 
J OAMntt 


dy/ds 

M nn ds - 

f ho 

dx/ds 

—dx/ds 


J BAMse. 

dy/ds 


M ns ds. 


Note that 


dx dy 
ds ds 


On = 


dx 

ds 


e. 


Since f3 for each element are independently chosen, from the stationary 
condition of II Be of Eq. (2a) with respect to 7 , we obtain 

( 6 ) c/3 = g 7u ,q, u - g 7 eq 0 , or /3 = c _1 (g 7tu q w - g 7 gq 0 ). 

Using Eq. ( 6 ), we can eliminate /3 from Eq. (5) and obtain 

(7) U Be = i qjktqg + i q B k g q B - q'£f B = ^q£k B q B - q£f B , 
in which the matrices are defined as follows: 

XI 

to 


( 8 ) Qb 


fs = 


g T w 

“i ?9 


C * [gyu, - g 7 fl] , 


k B = 


Syw^ gyu 


@>yw^ gy 0 


[-g 7 oC *g 7W k e + g^c x g 7 @ 



Sec. 20.3 


MIXED FUNCTIONALS FOR REISSNER PLATE THEORY 


815 


Tong and Pian (1969), Babuska (1971) and Brezzi (1974) examined the 
stability associated with assumed functions for multiple field functional. 
One issue is the existence of spurious modes in the finite element solution. 
To avoid spurious modes, the following stability criterion 

(9) n w < max(3, n c ) 

should be satisfied, where 3 is the number of rigid body modes, n c is the 
number of constraints on the parameters for w, and n 7 , n w are the number 
of parameters for 7 and w, respectively. This stability criterion can be 
applied to an assembly of elements if all elements are similar. However, the 
criterion is not applicable if some elements of the assembly fail while some 
pass. 

Under the stationary condition 

(10) <5II se = [ < 57 r D s (du; — 9 - 7 ) dxdy = 0, 

JAe 

for arbitrary £ 7 , the second variation of the functional ( 1 ) with respect to 
its field variables is 

( 11 ) J 2 U Be = l [ [(d e S9) T D b (d e S9)-S ; y T B s 6^]dxdy. 

Thus, the mixed functional given in Eq. ( 1 ) is a minimum-maximum 
principle (Tong and Pian 1969). In other words, for fixed 9 and w (q$’s 
and q w ’s), the functional is a maximum principle with respect to 7 (/3’s). 
With 7 fixed or in terms of 6 and w, the functional [such as the one given 
in Eq. (7) for an element] is a minimum principle with respect to 9 and 
w. One can also show that the mixed functional is bounded from above 
by the functional for the principle of minimum potential energy defined in 
Eq. (20.2:5). In finite element formulation, increasing the number of (3 (n 7 ) 
tends to increase the value of the maximum for given 9 and w and thus, 
increase the minimum of the maxima. Physically, increasing the number of 
f3 makes the model more stiff. In developing a new element, if one starts 
with the minimum number of n 7 needed by Eq. (9) and if the element is 
too flexible, then increase n 7 ; on the other hand, if the element is already 
too stiff, one has to reduce the number of /3’s or increase the order of 9 
and/or w at the risk of introducing spurious mode(s). 

Field-Consistent Element. We can examine locking relieve from an¬ 
other point of view. For clarity, we shall consider isotropic materials only. 
Equations (20.2:6), (20.2:7) and (2a) become 
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(12a) n be 12(1 _ ^ 2 ) 9 lx + e ly 

+ 2v8 X ' X 9 y>y + V (0 Xt y + 9 y , x ) 2 dxcty , 

(12b) n se = f ■■ ■ l(w, x - 6 x f + (w,y -6»y) 2 ] dxdy 

JAt 'H 1 + u ) 

(displacement model), 

(12c) n se = f - r 7x z{ w >x ~@x) “b 7j/z( u; ,!/ — Qy) 

JAe A 1 + v ) L 

7 2 +T 2 1 

- —¥1 dxdy (mixed model). 


4-Node Field-Consistent Rectangle. Consider a 4-node 21 x 2 h rect¬ 
angular element with x = l£ and y = hr). We can write 

(13) w = ai + 0 , 2 % + a 3 ri + a 4 £r), 9 X = &i + 6 2 £ + b 3 r) + b 4 £r) , 

Q y = ci + e 2 £ 4- c 3 rj + c 4 £t?, 

where a’, b’ and c’s are functions of the nodal values of the corresponding 
variables, e.g. [see Eq. (18.20:6)], 

«1 = (u>i + w 2 +u>3 + io 4 )/4, a 2 — (~W\ +W2 +W 3 - W 4 )/A, 

(14) 

a 3 = (—wi -w 2 +w 3 + w 4 )/4, a 4 = (uq - w 2 + w 3 - w 4 )/& . 

In displacement formulation, the shear strains 'Uz-,lyz derived from the 
assumed w and 0 X are 

Ixz = - &i + ( T “ &3 ) *1 ~ b 2t - h £tj , 

(15) 1 W / 

fl 3 . /a 4 \ 

7y* = y ~ ci + ~ C2) C - c 3 T) - c 4 ^. 

As f approaches zero, shear strains of the finite element solution approaches 
zero. This is equivalent to forcing 


(16a) 


a 4 

is ~T 

(16b) 

b 2 -> 0, 

b 4 -+ 0. 


Equations (16b) involve the coefficients of one field variable only and thus 
are considered to be spurious constraints that can lead to locking in the 
limit of thin plate. There are similar spurious shear constraints for c’s. 
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In the mixed formulation, we assume 


( 17 ) Ixz = A + fori , j yz = @3 + (3iZ ■ 

Using Eqs. ( 10 ) and (13), one can show that 


a 2 -wi+w 2 + w 3 -W4 & xl + 0* 2 + 0 x3 + 0 l4 

Pi = T -h = - Tl - 4 - 

/I o\ Q a i j. Wi -W 2 + W 3 - W 4 ~ 6 x i - 0 l2 + #x3 + 0 x4 
(18) p 2 = - i ) 3 =----- 

a _ a 3 -m -W 2 + W 3 + W 4 9 yl 4 - 9y2 + &y3 + 9y4 

P3 ~ h C1 _ _ _ 

/?4 = - c 2 = Wl ~ W 2 + W 3~ W 4 _ -®yl + 9y2 + 0y 3 - 0y4 

From Eq, (12c), one has 


(19) 


nEtab 

T+ 


?(■ 




which can be expressed in terms of the nodal values w's and 0 ’s as given in 
Eq. (18). One can then derive the element matrices as defined in Eq. ( 8 ) 
in a straightforward matter. 


4-Node Field-Consistent Quadrilateral. For general quadrilateral el¬ 
ements, it is difficult to determine the appropriate 7 precisely. We shall 
determine an approximate 7 using the covariant rotations and shear strains 
defined as 


( 20 ) 


'V 


9 X 


7«z" 


~ 0 £ 


'Jxz 

A. 

= j 

_ 9y _ 



_^jrj $T] 

l 




.7 r)z _ 



. 7yz _ 


Viv 


Assuming w, 0 „ in the form of Eq. (13) and 7 ^, 7 ^ in the form of 
Eq. (17) and, instead of following Eq. (10), using the approximate station¬ 
ary condition 


( 2 !) J ^ J i ^7«z(7jz - 7 (z) + S% z (l V z - 7n Z )] d£dr) = 0, 
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we find that the P's of 7 ^ and j vz are given by Eq. (18) with 6 X replaced 
by and 0 y by 8 V . The nodal values of 9^, 0 V can be transformed to those 
of 6 x ,8 y by Eq. (20). We then defined 

x * Pi P 2 V 0 0 

/vj ~ 0 0 ft ft£ 

with P's in terms of the nodal values of 6 X , f) y and w. We also approximate 
n s( . of Eq. (12c) as 

<23) n " = \i A ,wh ) { ’ ii ‘ + ^ )didy - 

Using Eq. (12a) and (23), we can derive the element stiffness matrix. This 
element passes the patch test. 

9-Node Field-Consistent Quadrilateral. For a 9-node quadrilateral, 
w, 0 n are biquadratic functions of the natural coordinates in the form 

222 

(24) a b'£Y . = Y! = 51 c u£V - 

i,j =0 i,j—0 i,j —0 

where a’, b’ and c’s are related to the nodal values of w, 9 n , respec¬ 
tively. The transverse shear strains can be expressed in terms of Legendre 
polynomials of £, 77 as follows: 

7 U = a ij — &o j ~ i b 2j + (2o 2j - bij)Z - b 2j K 2 --A V 3 . 

7=0 \ / . 

( 25 ) 2 

lr,z = ^ a H- c j0 ~\ C & + ( 2<2 J2 - Cjl)V - c j2 (V - ^ ¥ , 

The coefficients of the second order Legendre polynomials £ 2 —1/3 and r / 2 — 
1 /3 depend on the parameters of one of the rotations only and, therefore, are 
field-inconsistent. The element derived from such field-inconsistent strains 
converges slowly and will have strong quadratic oscillation in the shear 
stresses within an element. 

Using the orthogonal properties of the Legendre polynomial, from the 
approximate stationary condition Eq. (21), we find the field-consistent 
shear strains 
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(26) 


j =o 

i=o 


°ij _ b 0j — 2 b 2j + (2a2 j - &ii)£ 


®jl Cjo 2 Cj2 4 " (2Uj2 c jl)v 


V j , 

z j . 


The approximate field-consistent Cartesian strains are 
(27) 


Ifxz 

= jl 

% 

/ Yyz m 


.%Z. 


Using Eq. (27), together with Eqs. (12a) and (23), we can derive the element 
stiffness matrix. This element is free of locking and its stresses calculated 
from the field-consistent strains are smooth. It is much more difficult to 
establish field-consistent strains for an 8-node quadrilateral due to incom¬ 
plete terms to represent strains. Approximate field-consistent approach 
is a rational means to construct locking-free elements, even though exact 
field-consistent shear is yet to be established for general quadrilateral. 


Problem 20.5. Show that one can express a’s of Eq. (20.3:24) in terms of 
the nodal values of w (Fig. 18.13:3b) as 

Ul7 — 105 wt — 2 W 9 + ws 

ago = W9 , a 01 =---, oo 2 =-^-, 

W 1 — 1U2 + 103 — W4 


aio — 

2 ’ an - 

4 

012 = 

— W\ 102 + W 3 — W 4 

, W 8 — W 6 

4 

1 2 ’ 

O 20 = 

ws — 2 vj 9 + w 6 

2 

—wi — W 2 + U )3 + W 4 
<2-21 = A 

4 

022 = 

Wl + W 3 + U>2 + W 4 

4 

W5 + W6 + W7 + W8 . 

2 +W9 - 


20.4. HYBRID FORMULATIONS FOR PLATES 

The hybrid finite element method is an excellent alternative to overcome 
the compatibility difficulty for thin plate elements. The method is also a 
good alternative to overcome locking for thick plates. In this section, we 
shall discuss the hybrid stress method first proposed by Pian (1964) and 
the more general hybrid formulations for plate analysis. 


Assumed Stress Formulation. The functional in Eq. (20.3:1) becomes 
the hybrid stress model if the transverse shears Q satisfy a priori the 
moment equilibrium equation (16.15:13). Then Il se and Hbfe becomes 
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(1) n se = f ( — Q t 9 — i Q T D s *Q J dxdy + f wQ T nds, 

JAe V 2 / JdAe 

(2) n 6 / e = - / MnnQnds- / M ns 9 s ds+ I Qw ds , 

JdAMn, JQAhfac JdAQ c 

where n is a unit outward normal to the element boundaries and the tilde 
over Q is dropped for simplicity. The hybrid stress functional, like that 
for the mixed formulation, involves the three fields 0, Q and w. However, 
since w is now only needed along the element boundaries, we have more 
freedom to select the interpolation polynomial for w. Similarly to the mixed 
formulation, we assume 


(3) 6 = h e q fl , Q = Qh + Q P = Iiq/ 3 + Q p , w = h w q„ , 


where the matrices hs and h,„ are the same as those in Eq. (20.3:3) except 
for that h„, is needed only along dA e , Q a and Q,, are the homogeneous 
and particular solutions of 


dQx dQ y _ 
dx dy ' 


and 

(4) 


h Q 


10 x y 0 
0 1 -y 0 x 


Within an element, one can always approximate p(x,y) as a polynomial 
and determine Q p accordingly. The expression for Q p is not unique, but 
it generally has no effect on the order of accuracy of the finite element 
solution. The order of accuracy is determined by the order of complete 
polynomial for Q/ t or hQ. A substitution of Eq. (3) into Eq. (20.3:1) with 
II se and Hf, f c given by Eqs. (1) and (2) gives II Be in terms of q w , qg and (3 
(Tong and Rossettos 1977). If f3 is independently assumed for each element, 
it can be eliminated at the element level based on the stationary condition 
of IIse- One can then derive the element matrices in terms of qe, q w . 


Problem 20.6. Repeat Problem 20.3 by deriving the corresponding hybrid 
stress elements. 


General Hybrid Formulation. Introducing the Lagrangian multipliers 
w,Q n , one can write II se of Eq. (20,3:2) in the form (Pian and Tong 1969, 
1972, 1980) 

IL, e = [ Q n (w-w)ds+ f |Q T (dui - 9) - i C^D^Q 

JdAe JAe L 2 


dxdy. 
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The new functional has five dependent fields £>, Q, w, Q n , w, of which 
Q n , w are defined along the element boundaries only. The admissibility 
requirements are C° continuity for 9 and w at inter-element boundaries, 
smoothness of the field variables within each element and the enforcement 
of prescribed rigid contraint. The finite element equations can be estab¬ 
lished by assuming the field variables in terms of interpolated functions 
with unknown parameters 

0 - , Q = h Q /3 , w = , Q n = h^q-, w = h~q~, 

where h’s are interpolation matrices and q’s, (3 are unknown parameter 
column matrices. In the present formulation, there is no restriction on 
the selection of h q and h™ while h^, h~ are assumed along the element 
boundaries only. The functional is named the hybrid assumed strain model 
(Tang 1983), if Q is expressed in terms of the transverse shear 7 . 

If we set 7 = 7 = d w — 9, Q — D s 7 and Q n — n T D «7 in n se above, 
where n is a unit outward normal to dA e , the functional becomes 

(5) n 3e = f n T D„ 7 (u; - w) ds + ^ [ 7 T D S 7 dxdy, 

JdA c 1 Ja „ 

for the hybrid assumed displacement formulation (Tong 1970). The ad¬ 
missibility requirements are C° continuity for 9 and w at inter-element 
boundaries and the enforcement of prescribed rigid contraint(s) for 9 and 
w. There is no restriction on w. The functional has three field variables 9, 
w and w. One can assume 

9 = h^qo , w = h-q- , w = h^q^ 

to derive the element matrices from Eq. (20.3:1) based on the new II se . The 
stability criterion is 

(6) n w >nQ, 
which is similar to Eq. (20.3:9). 

Problem 20.7. Repeat Problem 20.3 by deriving the corresponding hybrid 
displacement elements. 

Remarks. One often judges a thick plate element based on its perfor¬ 
mance in thin plate applications. The 9-node Lagrangian and the heterosis 
elements (9-node for 9 and 8 -node for w) with selectively reduced inte¬ 
gration perform well in examples of clamped and simply supported square 
thin plates with rectangular mesh (Zienkiewics and Taylor 1989, Hughes 
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1984). The results deteriorate rapidly when the mesh is distorted to include 
non-rectangular elements. Mixed (field-consistent) and discrete constraint 
methods are alternatives for eliminating locking for general triangular and 
quadrilateral elements. The field-consistent and the discretely-constrained 
elements discussed in the previous section, and the mixed triangular ele¬ 
ments developed by Xu (1986), Arnold and Falk (1987), Zienkiewicz and 
Lefebvre (1988) all perform reasonably well. Locking can be removed using 
hybrid formulations (Pian 1993). General hybrid approaches have not been 
fully explored. 

The thick plate theory requires three boundary conditions while the 
thin plate theory needs only two. We may expect the two theories give 
different deformations for different boundary conditions when the plate is 
thick ( l/t is of order 10, where l is dimension of the plate). Generally the 
thick plate theory gives considerably larger deflections. For plates with l/t 
of the order 100 or more, the thick element deflections generally converge 
to the Kirchhoff plate results for hard support conditions (specification of 
w, M nn and 0 a ), but to slightly higher values for soft support conditions 
(prescription of w, M nn and M ns ). Babuska and Scapolla (1989) solved a 
rhombic plate of l/t = 100 as a three-dimensional structure with support 
conditions of the soft type and found that the result is very close to the 
thick plate solution of soft support. 


20.5. SHELL AS AN ASSEMBLY OF PLATE ELEMENTS 

Shell, like plate, is a particular form of three-dimensional solid whose 
thickness is small as compared with other dimensions. The use of general 
3-dimensional finite elements in thin shell applications usually encounter 
Hocking ’ (see Sec. 18.20) unless the element size is of the order of thickness. 
The Gaussian curvature (i.e., the product of the principal curvature of the 
midsurface of an unloaded shell) is either zero (as in cylinder) or positive 
(as in an ellipsoid or spheroid) or negative (as in a hyperboloidal shell). Due 
to the curvature of the midsurface, the stress resultants can resist pressure 
normal to the middle surface. In fact, this mechanism is very efficient 
and is utilized to carry a major part of the external load in engineering 
applications to shells. In the linearized plate theory, a flat plate supports 
external loads through bending. 

The combination of stretching and bending plus the complexities of the 
curved geometry make the derivation of two dimensional governing equa¬ 
tions for shells difficult and lead to various different shell theories. Mod¬ 
eling the different shell theories results in different shell elements and is 
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a subject of continued research. In this section, we shall consider the ap¬ 
proximation of shell as an assembly of flat elements and thus avoid the 
complication induced by the curvatures of the shell surface. In this model, 
the flat plate assembly approximates the lateral load carrying mechanism 
of shells through the interaction of stress resultants between adjacent ele¬ 
ments. Through the discrete angle at the junction of neighboring elements, 
part of the midplane stress resultants of an element is transmitted to its 
immediate neighbors as transverse shear force to resist the lateral load. As 
the size of subdivision decreases, the geometry converges. Intuitively, we 
expect that the flat-element assembly behaves like the shell in the limit. 

One advantage of treating shells as an assemblage of flat elements is 
that no new elements are needed. An added advantage is the ease of cou¬ 
pling with edge beams and stiffeners in practical applications. Note that, 
however, flat elements are not compatible in shell applications if the or¬ 
der of polynomials used for ‘in-plane’ displacements differs from that for 
out-plane displacement. 

A general shell can always be divided into a triangular mesh. The ver¬ 
tices of a triangle are always co-planar. For meshes involving elements of 
more than three nodes, the elements may not be flat. Only some shells 
such as cylinders or spheroids, which can be covered by orthogonal curvi¬ 
linear coordinates, are representable by flat quadrilaterals. (At the poles 
of a spheroid, triangles are used. In the finite element method, a triangle 
can be consiered as a degenerated quadrilateral.) The number of surfaces 
totally covered by orthogonal curvilinear coordinates is known to be 8. The 
analysis of such shells (cylinders and spheres) using flat quadrilateral ele¬ 
ments has met early success. For more general surfaces, due to the poor 
bending performance of the early triangular plate elements, shell analy¬ 
sis with triangular elements was not satisfactory. Current improved plate 
elements can now well represent the behaviors of general shells. 

The flat elements in shell applications are subjected to simultaneous 
in-plane stretching and out-plane bending. Thus, we must include both 
deformations in the energy expression. Let the local coordinates x and y 
be in the midplane of the plate element and z be normal to the plane. The 
functional is the sum of the potential energy of membrane stretching and 
plate bending, i.e., 

(i) n e = n Be + n Me , 

where IIb<. is the bending potential energy given in Eq. (20.1:10) based on 
the Kirchhoff theory, or Eq. (20.2:5) for the Reissner-Mindlin theory, and 

IlMe = lime — II mfe i 
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where 

II me = [ i<(d e u) T D e (d e u)^, 

JA r . 2 

n m/e = / fu T bfL4 + j tu T Tds. 

J A c 'dA„e 

In the equations above, u is the in-plane displacement, IIMe is the 
membrane potential energy of the element, which is the same as n e of 
Eq. (18.14:6) except for the factor t to reflect the thickness of the element, 
and D e is the elastic modulus matrix for plane stress. Note that II me is 
the membrane strain energy and II m / e is the external work associated with 
membrane action. 

Following the earlier derivation, we can write the functional in the ma¬ 
trix form 

( 2 ) n e = \ qMk M q M + i q£k B q B - q^f W - q B f B , 

where the subscripts M and B denote membrane and bending actions, 
respectively. The column matrix q m is the nodal parameter matrix of the 
in-plane displacements u and v. The matrix q B is the nodal parameter 
matrix of the out-plane displacement w and its derivatives w zX and w zV 
for thin plates, or those of w, 9 X and 9 y for thick plates. In subsequent 
discussion, for expediency, we will also use 9 X and 9 y to denote w zX and 
w zV for thin plates. The element matrices k m and f m are k and f of 
Eqs. (18.14:13) and (18.14:14) multiplied by the plate thickness t; while 
k B and f B are given in Eqs. (20.1:16) and (20.1:17) for thin plates, and in 
Eq. (20.3:8) or derived from Eqs. (20.2:6)-(20.2:8) for thick plate. Let us 
introduce an additional nodal parameter matrix c\ z o{— [8 Z i 8 Z 2 ■. .] T ), which 
associates with 8 Z , the rotation about the z-axis (normal to the midplane 
of the plate) and define q as the unknown parameter matrix of the element 
that 

(3) q = [q£ q£ qj &. 

Now each node has six degrees-of-freedom, three translations and three 
rotations. The rotation about the normal generally does not appear in 
plate equations but is needed in shell application. We can write Eq. (2) in 
the standard matrix form 

n e = ^ q T kM B q - q T fws , 


( 4 ) 
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where the matrices k mb and {mb are defined as 




kjw 

0 

o' 


f M 

(5) 

k MB — 

0 

he 

0 

s’ 

to 

II 

f B 



0 

0 

0 


. 0 . 


where the bolded ‘0’ denotes a null matrix. Note that membrane and 
bending actions within the element are uncoupled and that the rotation 8 Z 
does not contribute to stiffness. This leads to a singular matrix for the 
system if all elements meeting at a node are coplanar. 


Coordinate Transformation. In the previous discussion, x,y are the 
in-plane coordinates and z the out-plane coordinate with respect to the 
midplane of the plate element. All generalized coordinates are therefore 
also referred to this local coordinate frame. Transformation of the local 
generalized coordinates to a common global coordiante system is necessary 
before assembling the element. We shall establish the transformation rule 
below. Let the global coordinates be x, y, z with unit base vectors e x ,e y ,e z , 
the local cooordinates be x,y,z with unit base vectors e x ,e y ,e z , and the 
coordinates of node i be specified by the global coordinates (, £,). We 
choose the first node (an arbitrary choice) of the element as the origin of 
the local coordinates and establish the transformation from the global to 
the local coordinates in the form (see Sec. 2.8) 


[x y z] T = R(x-xi), 


where 


R = 


db x y Rxz 
Ryx Ryy Ryz 
Rzx Rzy Rzz 


is a rotation matrix with R 1 — R T . The base vectors of the two coordinate 
systems are related by 


e x ^ 


e T 

e y 

= R 

*y 

e* 


A. 


Let the x-axis be along the side between node 1 to node 2 (an arbitrary 
choice also), then 


e x = X21&X + y2lVy + Z2ie z 
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where 

Xji = {Xj - Xi)/lji, Vji = (fjj - Vi)/lji , Zji = (Zj - , 

in which Iji — \/(xj — x,) 2 + (y ? - ~ y l ) 2 + (z :l — 2 t ) 2 is the distance between 
nodes i and j. Since e x = R xx Gx + Rxy^y + Rxz&z, one has 

Rxx “ ^21 i ^-xy ~ 1/21 ) Rxz — %21 • 

Let e be a unit vector pointing from node 1 to node 3 


e = x 31 e x + y 31 e y -f 2 31 e 2 . 

Assuming nodes 1, 2, 3 are not collinear, we find e z from the cross product 
of e x and e, 


G z — . . — Rzx^x d~ Rzy^y T Rzz&z j 

\&x ^ 

where 

n 2/21^31 ~ 1/31221 n ^21^31 - 231^21 n X211/31 ~ ^312/21 
M-zx - ^ , ti-zy - A , Kzz - ^ 

A = \/(y 2 lZ31 ~ mhl) 2 + (221^31 - -231521 ) 2 + (X2l2/31 “ Z3l2/2l) 2 ■ 

Finally, we have 


Gy — G z X G x — Ryx^x d~ Ryy e y "f Ryz^z ? 

where 

Ryx “ RzyRxz RxyRzz > ~ RzzRxx RzxRxz i 

Ry z = R 

zxRxy RxxRzy • 

Let qi be the generalized coordinate matrix of the ith node defined as 

q» = [ Ui Vi Wi 6 x i 9 y i 9 Z i ] T , 

and q» the corresponding column matrix referenced to the global coordinate 
system. The components of the two column matrices are related through 
rotation: 

' R 0 
0 R' 


q«= Rq» 


R = 
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where 


R/ — 


Ryy 

-R x 




R-yZ 

R'XZ 


The matrix R' is modified from R to account for mapping two local rota¬ 
tions to three components in the global system and the fact that 0 y , —9 X 
are, respectively, the rotations about the x, y-axes. If q is arranged in the 
order that 



qi 


'R o ..." 


qi 


O 

q = 

qz 

= 

0 R 


q2 

= 

0 R 


When the parameter matrices are expressed in terms of those in the global 
coordinates, Eq. (4) becomes 


( 6 ) 

where 


n e = l q T k WB q - q T f mb , 


~R T 

0 


R 0 


0 

r t 

k MB 

0 R 








fj MB 


R T 0 
0 R t 


?mb ■ 


The element matrices k mb> $mb can he readily assembled. If the global 
coordinates are not the same for all nodes of the element, a situation occurs 
often at the boundaries with mixed boundary conditions, one must use the 
appropriate rotational matrix R for the specific nodes involved. 

A difficulty arises if all the elements meeting a node are co-planar. This 
occurs at straight boundaries along the cylindrical axis. Due to zero stiffness 
for 9 Z as shown in Eq. (5), the assembled stiffness matrix is still singular 
after boundary constraints are imposed. When this occur, the formulation 
is still valid because the generalized force corresponding to this degree-of- 
freedom is also zero. However, the singularity of the matrix poses problem 
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in numerical computation. If the global and local coordinate directions are 
difference, detection of such singularity will be more involved. 

Drilling Degree of Freedom. When singularity occurs, one may re¬ 
move the degrees-of-freedom associated with the normal rotation or add 
arbitrary stiffness to those degrees-of-freedom. An alternative is to mod¬ 
ify the element formulation such that the rotation about a normal to the 
midplane generates strain whether co-planar or not. Such a rotation is 
called the drilling degree-of-freedom. Abu-Gazaleh (1965) initially pro¬ 
posed the concept. Early results were disappointing (Irons and Ahmad 
1980), but it has become a topic of much study (Robinson 1980, Mohr 
1982, Bergan and Felippa 1985, Cook 1986, Allman 1988, MacNeal and 
Harder 1988, Hughes and Brezzi 1989, Iura and Atluri 1992, Chen and Pan 
1992, Sze and Ghali 1993). One can use the drilling degree-of-freedom as a 
tool to improve the performance of low order membrane and solid elements. 

For a 4-node quadrilateral, Cook (1986) used drilling degrees-of-freedom 
9 Z to replace the in-plane displacements normal to the sides of the element 
at the midside node as follows: 

4 4 

u = Yl hi tt' r) ' )Ui + /l *+4(£,??)(% - y*)\°zj - M/ 8 > 

(7) i=1 i=1 

w 4 4 

v = 5Z v) v i + 51 /l *+4(£, v)i x i - Xj)[0zj - 9zi\/ 8 , 

i=l i— I 

where hi are the shape functions for 4-node quadrilateral [Eq. (18.13:2)], 
h { + 4 is associated with the midside node of an 8-node quadrilateral between 
nodes i and j with i, j, k being the cyclic permutations of 1, 2, 3, 4. The 
shape functions for midside nodes are given in Table 18.3:1. This choice of 
shape functions for 9 z 's is to make the in-plane shear strain constant and 
consistent with the deformation of in-plane bending for one-dimensional 
problems. The element now has 3 degrees-of-freedom per node for in-plane 
displacements (two translations and 9 Z ) and thus 12 degrees-of-freedom to¬ 
tal as opposed to 8 for the regular 4-node element without drilling degrees- 
of-freedom. In shell applications, three components of rotation at a node 
are usually needed. Thus the added drilling freedom does not increase the 
number of degrees-of-freedom per node. While the quadrilateral element 
with drilling degrees-of-freedom gives good results in plane stress analy¬ 
ses, a corresponding 3-node triangular element performed poorly unless the 
tangential midside displacements are included. This makes the element 
equivalent to the 6-node triangle. 
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Elements with drilling freedoms as defined in Eq. (7) have spurious 
modes. One is zero translation and constant normal rotation ( u t = Vi = 
0, i = 1,2,3,4, and 9 Z \ = ••• = 9 Z 4). This mode is global in a plane. 
The element also has a second spurious mode d z \ = -0 z2 = 8 Z 3 — —0 z4 ). 
The last mode is non-communicative. There could be even more spurious 
modes if reduced integration is employed. One way of suppressing the 
spurious modes is to introduce artificial rotational stiffness by adding 


( 8 ) 



atG 




2 

dA 


to Eq. (1), where G is the shear modulus and a a small constant. For large 
value of a, the solution approaches that of an element without drilling 
freedoms. MacNeal and Harder (1988) gave 10“ 6 as the threshold of a for 
a spherical shell with the radius to thickness ratio of 250. In practice, one 
often uses the one-point integration rule to evaluate 11,;. Due to the shell 
curvature, the drilling degree-of-freedom in a curved shell-element couples 
the normal rotation to bending strains. The coupling effect on the bending 
performance requires further studies. 

One can use the drilling freedom to improve the in-plane deformation 
of low order membrane and solid elements in plate and shell applications. 
One derives the element matrices by including lid as defined in Eq. (8) for 
plane elasticity, or 



for 3-dimensional elasticity, where a x , a y , a z are constants. The optimum 
values for these constants depend on the aspect ratio of the element. Atluri 
(1980) introduced variational functionals that include rotations as depen¬ 
dent variables. Ibrahimbegovic (1990), Iura and Atluri (1992a), Chen and 
Pan (1992), and Sze and Ghali (1993) have studies the use of variation 
functional to formulate elements including drilling freedoms. 

Employing drilling freedoms increases the number of degrees-of-freedom 
per node but improves the accuracy of 3- and 4-node elements. The use 
of displacements as defined in Eq. (7) (MacNeal 1994) improves the accu¬ 
racy in strains and provides substantial relief from shear locking, but has 
little effect on dilatation locking. Pian (1993) and MacNeal (1994) gave 
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comparisons of the performance of various plane elements with and with¬ 
out drilling freedoms. 


Four-Node Flat Element Approximation. To approximate a non- 
coplanar 4-node shell midsurface by a 4-node flat element, one often sets 
the midplane of the element parallel to the two diagonals of the 4-node 
midsurface at midway between the diagonals. Thus the nodes of the shell 
lie on the normal lines through the nodes of the element and are at equal 
distance to the midplane. Let the coordinates of the nodes of the shell be 
(xi,yi, Zi), then the absolute values of all are equal and the coordinates 
of the nodes of the element are 0). We assume that node i of the 

element and node i of the shell move together as a rigid body and the two 
points have the same rotation parameters. The displacement parameters 
(u', v', u;') of the shell are related to those of the element by 

u' = Uj — Zi[9 X i Oyi 0] T , i = 1,..., 4. 


Defining the nodal parameter matrix as 


q' = [it' v\ w\ 6 xi 0 yi 6 zi } r , q 4 = [u» v z w % 0 xi 6 yi 0 zi ] T . 
we have 


(9) q = Gq\ q 


qi 


qi' 

q2 

, q' = 

q' 2 

. : . 


. : . 


, G = dia [Gi G 2 G 3 G 4 ], 


where (G i 4 )j = (G 2 s)i = — and all other ( Gjk)i = 5jk- A substitution of 
Eq. (9) into Eq. (4) yields 


U e = - q' T k' MB q' - q lT ? M 


where 

kjwB == G 7 kMsG , f mb — G r f mb ■ 

In this case, we have replace k mb, fwfi with k' MB , f' MB in Eq. (6). As 
such a transformation may accidentally excite the bending modes, one may 
replace the induced moments in f MB with vertical couples (the nodal force 
times Zi at the ith node). 

One may improve the flat element approximation by shallow shell ele¬ 
ments where the element plane is just the projection of the shallow shell. 
The displacements ( u,v,w ) and rotations {9 y ,—9 x ,8 z ) are defined as the 
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tangential and normal components of the displacement and rotation of the 
shell surface. The strains are defined in terms of their derivatives with 
respect to the coordinates of the projection plane. The in-plane strains in¬ 
volve both the tangential and normal displacements. A formulation of shall 
shell elements was given by Cowper et al. (1970). Shallow shell formulation 
will introduce membrane locking. Because of the additional complication 
of shallow shell elements, they are not used often in practice. 

Problem 20.8. Use the definition of the nodal parameter matrix qj = 
[ui Vi Wi 8 X i 8 v i 6zi] for node i, determine the cooresponding k and f in terms 
of components of kt, k a , f& and fi, of Sec. 20.2. 

Problem 20.9. A common choice of the local coordinates for a flat quadri¬ 
lateral is to use a bisector of the angles between the element’s diagonals as the 
x- or y- axes. Find the transformation matrix R (in terms of the nodal coordi¬ 
nates referred to the global system) that transforms the global coordinates to the 
local ones. 

Problem 20.10. Physically, there is a difference between a shell formed 
by flat surfaces and the approximation of the assembly of flat plate elements 
(Fig. P20.10). For the shell of flat surfaces, the cross-section is a quadrilateral 
as shown. The line connecting the top and bottom surfaces along the C direction 
is not generally normal to the midplane. Such differences can be significant for 
thick shells. Consider the middle element in Fig. P20.10(b): (a) determine the 



(a) Assambly of flat plate elements 



(b) shell with flat surface 


P20.10. Difference between shell of flat surfaces and assembly of flat elements. 
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coordinate transformation from the natural coordinates ( to the x, z Cartesian 
coordinates; (b) find the angle between the (- and ^-directions; (c) assume that 
the displacements «, w are bilinear in £ and (, determine the strains e Bl e 2Z , e xz 
in terms of £, (; and (d) find the difference between the strains of the quadrilateral 
element and those of the flat plate element [Fig. P20.10(a)]. 

20.6. GENERAL SHELL ELEMENTS 

Conceptually, shells can be modeled by curved 3-dimensional elements 
of reduced dimension in the shell thickness direction. Unfortunately, sim¬ 
ple reduction of the thickness dimension leads to overly stiff elements in 
thin shell applications unless the in-plane dimensions of the element is 
in the same order as the thickness dimension. To avoid such difficulty, 
like plates, finite element formulations were introduced to directly model 
the Kichhoff thin shells and the Reissner-Mindlin moderately thick shells. 
Reviews of the subject are given by Gilewski and Radwanska (1991) and 
Ibrahimbegovic (1997). Atluri and Pian (1972) are among the many who 
developed the early general thin shell elements based on variational ap¬ 
proaches. Chang et al. (1989) developed a 9-node mixed Lagrange shell 
element. Ahmad et al. (1970) constructed the early curved shell element 
following the Reissner-Mindlin assumptions for thick plates. The Ahmad 
element has severe locking problems in thin shell applications. 

Finite element method for general thin shells is a difficult subject. Due 
to bending and stretching coupling, it is harder as compared to plate ap¬ 
plications to satisfy the compatibility requirements along the interelement 
boundaries. The coupling causes further locking, especially for lower order 
elements. Thus a general shell element is either complicated or simple but 
only accurate for limited shell configurations. 

Direct discretization of 3-dimensional solids in terms of midsurface nodal 
variables provides great attraction in simplicity. It does not depend on the 
various forms of shell theories. With attentions to the thickness deformation 
and transverse shears, one can effectively remove locking in thin shell ap¬ 
plications. Elements so derived are called degenerated solid shell elements. 
In subsequent discussion, we shall develop the degeneration process. 

Shell Geometry and Coordinates. To account for the double curva¬ 
tures, plane stress conditions, in-extension of fiber through the shell thick¬ 
ness, we shall use five coordinate systems for different purposes. They 
are: (1) the local coordinates with Cartesian unit base vectors e x ,e y ,e z 
for the geometry description of the element; (2) the natural coordinates 
with covariant base vectors g^,g n ,g ? for strain-displacement relations; 
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a. quadrilateral 



3 



Fig. 20.0:1. Curved shell element. 

(3) the fiber coordinates with unit base vectors ef,e^,e t for fiber rotations; 

(4) the normal coordinates with orthogonal unit base vectors e^,e n ,e^ for 
defining stress strain relations and enforcing plane stress condition; and 

(5) the global coordinates with Cartesian unit base vectors e x ,e y ,e z for 
assembling of elements. Using the tensor transformation rules (Chapter 2), 
one can easily transform quantities from one coordinate system to another. 
The first and last coordinate systems are generally fixed Cartesian coor¬ 
dinates. Strictly speaking, they can be the same. However, retaining a 
local system offers some conveniences especially for mixed boundary value 
problems. The first four systems are local. The second one is usually non- 
orthogonal for general shells. The third one is always and the fourth is 
usually orthogonal and curvilinear. 

Let x, y, z be the local Cartesian coordinates with unit base vectors e x , 
e y , e z . The location of a point in the shell (Fig. 20.6:1) is 

(1) x = x 0 (£, rj) + <gc(£, v) = zoe* + y 0 e y + z 0 e z 4- Cgc(£. v) , 

where the subscript 0 denotes the value at the reference surface, Xo (£,??) 
is the position vector of the reference surface and gf is the covariant base 
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vector in the direction of a fiber through the point at £, rj on the refer¬ 
ence surface. In finite element modeling, the relations between the local 
Cartesian and the natural coordinates are defined as follows: 

(2) xo(£, r}) = v)(xo)i . 

(3) g ( =Yl hi (^V)(ts)t/2, 


in which h's are the shape functions defined in Chapter 18, i sums over all 
the nodes defining the element reference surface , the subscript i indicates 
association with node i, e.g., (x 0 )i is the position vector of node i of the 
reference surface, and and s, are the thickness and the unit vector in the 
fiber direction at node i defined as 



in which xf , x~ are the position vectors at the top and the bottom surfaces 
of the shell at node i. The reference surface is usually the neutral surface 
of the shell. We shall assume the midsurface as the reference surface and 
leave the derivation for the general case to the readers. Thus the ranges of 
the natural coordinates £, 77 , C are ±1. 

From Eq. (1), we can derive the covariant base vectors [Eq. (2.14:7)] of 
the natural coordinates of the shell: 


( 5 ) 


gf = x ,t . g>> = = X ,C - 


These base vectors are not orthogonal for general shells. 

In the fiber base coordinates, one of the coordinates is along the fiber 
direction g(. The other two are defined to be orthogonal to g<;, otherwise 
arbitrary. We choose 


( 6 ) 



e - X & 

" l®c x gel ’ 


x , 


as the unit base vectors where “x” denotes the cross product. Thus, = s* 
at node i and e r; is tangent to the contravariant base vector g v of the natural 
coordinates. If g^, g v are orthogonal, = g^/|g^| and e v — g^/lg^l 

coincide with g£, g^, respectively. Using above, one can determine the 
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contravariant base vectors of the natural coordinates (Sec. 2.14): 


( 7 ) 


,e = g*> xS < a v = _c . sc 

teellfifcj xg cl ’ |gr;l |«c x g e | ’ |g c | 2 ' 


Note that |g, x e c | 2 = (g„ ■ g n )(e c • e c ) - (e c ■ g v ) 2 . 

The normal base coordinate system has one axis normal to the surfaces 
£ = constant. The other two base vectors are in the same plane as g^, g v 
and are chosen to coincide with the material coordinates of the surfaces 
£ = constant. For isotropic materials, these two base vectors, other than 
normal to surfaces £ = constant, can be arbitrarily chosen. Similar to the 
fiber base coordinates, we define the unit base vectors as 


( 8 ) 


e C 


ISf x Sul 


e < X g£ 

|e< x gd ’ 


e$=e,xe c . 


If g£> gi) are orthogonal, then e^, e v and e$, e,, coincide. 


Covariant Strains in Natural Coordinates. We shall derive the covari¬ 
ant strains in the natural coordinates. Let 9 V , —9^ represent the rotations 
of the fiber about the axes along e f , e,,, respectively. The displacement 
vector can be written as 

(9) u = u 0 (£, if) - + 6 v e n )/2 , 

where uo is the displacement of the midsurface. The second term denotes 
the fiber rotation. Equation (9) allows a fiber to rotate, but not stretch. In 
finite element modeling, we represent the displacement by 

(10) U = Y1 hi ^' r ?)( u °) i “ < hi 7 ?)K 0 €®€ + 0r,e v )t]i/2 , 

i i 

where the shape functions h 's will be different from those in Eq. (2) if 
the nodes for interpolating displacements differs from those for defining 
the curved shell surface. The interpolation functions of a 9-node element 
are given in Table 18.13.1. The first sum in Eq. (10) is the interpolated 
displacement of the midsurface. The second sum is that induced by fiber 
rotation. An alternative is to represent the displacement by 

(10a) U = . v)(uo)i - £ ^2 M£, vWtet + 6 v e v )t]i/ 2 , 

with 9 V , —9( denoting the rotations of the normal about the axes along 
respectively. In this case, there is no need for the fiber base coordi¬ 
nates. 
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Prom Eq. (10), we can derive the deformation gradient, 

K 


du ^ dh 


(11) 


du _ ^ 

dr) ^ 


d{ 

dhj 

dr) 


u o ~ y + ^n) 


uo - y (#£% + ^e r) ) 


"57- ^2 T ' 


d( ' 2 

i 

The covariant in-plane strains are 

du du 1 / du du \ 

(12)e K = -. ft , e„ = — g„, et „ = -^-.g f + -.g,j, 

and the covariant transverse shear strains are 


. 1 / du du 

(13) e « _ 2 V^C ‘ g « + d^- g < 


1 /du du 

6v< ~ 2 V d( ' &v + d^ ' gC 


Prom Eqs. (10)—(13), one finds the covariant strains in terms of the nodal 
values of uo, 0■ n . Prom Eq. (13), we have 

e <C = X) g c • Mi ~ ^ T g « ' 


C "7^ + 6 rfi n )t\i . 


The last sum represents the contribution of curvature changes to the trans¬ 
verse shear and is zero for flat elements or only at the nodes in general. A 
similar expression can be derived using Eq. (10a). 

The engineering shear strains can be obtained by multiplying the phys¬ 
ical components of the shear strains by 2. Since g£ and g 7( are linear 
in (, e££, e vn , are quadratic and e^, e ^ are linear in (. Note that 
e<^ = • du/d( is only approximately equal to zero due to the approxima¬ 

tion of Eq. (10) or (10a). One often neglects in practical computation. 
The use of the representation of Eq. (10) or (10a) avoids the explicit cal¬ 
culation of the Eulerian Christoffel symbols [Eq. (2.15:4)] in deriving the 
deformation gradient. 


Example-Ring Element. Consider a 2-node ring element of thickness t 
and radius ro over the angle —ipo < <p < <po- The nodes are at r = ro, <pi t 2 = 
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±<y5 0 . The unit base vectors e r , of the fiber base coordinates coincide 
with those of the cylindrical coordinates at the nodes. The finite element 
representations of ring and displacements are 

2 

x = x 0 + <te r /2 = (r 0 + 0/2) ^(e r )i/i<(£), 

i = l 

2 

u = E O(0(uo + C%e v /2)i, 

1=1 

where 

C = 2 (r-ro)A, t = <p/<p o, hi = (1-0/2, ^ = (1 + 0/2- 

The covariant base vectors and the deformation gradient are 

gf = £ (r 0 + [(e r ) 2 - (e r )i], g C = \ ^(e r ) i /i i (0 , 

du (u 0 ) 2 - (u 0 )i , (0^)2 - (^e v )i ^ du t ^ , t 

-2- + -4- c< ’ ac = 2^ {d ^ )ihi ' 

The covariant strains of the natural coordinates are 

s« = \ (r 0 + f) [(uo)2 - (uo)i + C*] [(Sr)a - (S r )i], 

He = (n, + j'j I(er ) 2 - (®r)i] • 

+ [(uo) 2 -(uo) 1 + ^ )2 ; ( ^ )l - Ct] •!>)«*}. 

The physical components of the strains [Eq. (4.10:11)] are 

e a = £ «/lecl 2 - e « = «€c/(Ik«I led) • 

Constitutive Equation. A general plane stress constitutive equation can 
be written with respect to the normal base coordinates in the form 


(14) 


cr = D 3e £, 
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where <r, e are the in-plane stress and strain matrices defined as 
(15) (T [o££ ^rjrj rj ^ r/7/ 2££^ 2£££ 2£,^] , 


D 3 e is a 5 x 5 symmetric elastic modulus matrix for plane stress derived 
from the 6 x 6 modulus matrix for 3-dimensional elasticity [Eq. (18.16:4)] 
by enforcing the zero normal stress condition = 0 ) in the direction 
normal to the shell surface. We use the subscript 3 to distinct the elastic 
modulus matrix from the 2-dimensional case. For materials with transverse 
shear strains £^, £ r) ^ decoupled from the rest of the strain components in 
the constitutive equation, we have 


(16) 


D3c = 


D e 

0 


0 

D s 


where D e is a 3x3 elastic modulus matrix for plane stress [see Eq. (18.14:23) 
for orthotropic materials], D a is a 2 x 2 transverse shear modulus matrix. 
To attain results consistent with classical bending theory, a shear factor 
needs to be introduced, which usually amounts to multiplying the transverse 
shearing moduli by k = 5/6. For isotropic materials, D, is a diagonal 
matrix given in Eq. (20.2:4). 


Strain Transformation. Strain transformation between coordinate sys¬ 
tems follows the tensor transformation law (see Sec. 2.5). The physical 
components of the strain teusor in the normal base coordinates can be 
expressed as those in the natural coordinates as 




£ « 



£ tv 

£ «C 

£ in 

Zryri 

£ vC 

= r t 

£ (v 

^rjr) 

£ vC 

% 

% 

£ CC 


£ « 

£ v( 

£ Cf _ 


(18) 


g< -e c 


g £ • 
g C -e v 


g^ ' 
g v -e c 
g C • e c 


where (■) denotes dot product. From Eqs. (17) and (18) together with 
Eqs. (12) and (13), we can obtain £y in terms of shape functions and nodal 
values of Uq , 0 ^, 0 ri . To be consistent with the shell theory, one can neglect 
in Eq. (17). The engineering shears 7 ’s can be obtained by multiplying 
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the corresponding shear strains e's by 2. Then we can write the strain- 
displacement relation in the matrix form 

p 

(19) e = Bq = ]TB i q i , 

where B is the strain displacement matrix, p is the number of nodes of the 
element, the subscript i denotes association with node i and 

(20) qi = [(u 0 )f (0 € )< (6>„)i] T , q=[qf qj ••• qj] r . 

The calculation of B from Eqs. (l)-(7), (10)—(13), (17) and (18) is tedious 
but straightforward. 


Element Matrices. The shell element shown in Fig. 20.6.T is a solid whose 
top and bottom surfaces are curved and whose sides across the thickness 
are generated by straight lines (fibers). We shall derive the matrices for the 
shell element from the potential energy given in Eq. (18.16:9), converted to 
the shell geometry below, 


(21) lie = f f (z e T D 3e i - u T b') f dA 
Ja c J -i \2 / 


~ I j ^ 

JdA„ e J-1 


T tJ s d( ds + 


/ PU 0 (£,t}) 

JAr 


e co dA , 


where zero shear on the top and bottom shell surfaces is assumed and 


(22) p = T- J~ A - T+ J+ “ T- - f+ , 

in which denote prescribed normal traction on the upper and lower shell 
surfaces, respectively. The quantities J A , J s are, respectively, the Jacobians 
for the volume and the surface integrals normalized by their corresponding 
values at the midsurface defined as 


(23) J A = gc • (i« x gt,)/|g< • (g«o x g^ 0 )|, 

(24) J 9 = |g< x g«|/|g c x g a0 |, 

where a = £ if the boundary is along the curve p — constant, and a ~ p if 
the boundary is along the curve ( = constant. Both g£ and g v are functions 
of £, ti, ( and so are J A and J 3 . If the boundary of the shell element is part 
of cylindrical surface, J s is 1. In the case of a plate, J a is also 1. In practice, 
one may approximate J a and J a as unity if the shell is thin. 
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Using Eqs. (10) or (10a) and (19), from Eq. (21), we can derive the 
element matrices. The element stiffness matrix is in the form 

(25) n e = lq T kq-q T f, 

where f involves all integrals linearly in u and Uo in Eq. (21) and 

(26) k = [ f tB T B 3e BJ A dC dA . 

Ja 2 J -i 

Gaussian quadrature is most efficient for integration along the fibers if the 
shell consists of a single homogeneous layer. Usually a two-point Gauss 
rule is adequate to sense the bending and membrane actions. For layered 
laminates of different materials through the thickness, the one or two-point 
Gauss rule has to be applied to each layer. 


Global Coordinates. We must transform q of the element to the global 
coordinates before assembling. In Eq. (20), the components of uo of node i 
refers to the local coordinates and 0$ , 0 V refer to the fiber base coordinates. 
Different transformations are needed to transform them to the global sys¬ 
tem. Let e x ,e y ,e z be the unit base vectors of the global system and q the 
displacement and rotation matrix in global coordinates be defined as 

(27) q=[qfqj--. qJ] T , q, = [u % , * = [?, MX 

Note that q, is a column matrix of six components and that 0 y , —0 X denote 
the rotations about the x, y- axes. Then 


(28) 


(29) 



e x • Q y e x • g z 


® y ' ® z I (^o)i j 


&z ' e x e z ' e y e z ' e z 


V 


® T) ‘ 


e v ' e C 

dr, 

j 

_ 


S € ' . 


Substituting Eqs. (28) and (29) into Eq. (25), we obtain the element ma¬ 
trices associated with displacements and rotations in global coordinates to 
be assembled. Note that if we represent the displacement by Eq. (10a), we 
will have to replace e(,e v ,e( by e^, e v , §<;. 


Drilling Degree-of-Freedom and Coordinate Transformation. 
Equation (29) transforms at each node two rotations in local to three 
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rotations in global coordinates. Like plate assembly, the assembled stiff¬ 
ness matrix will be singular, if all elements meeting at a node have the 
same fiber direction at the common node, which is most likely the case. 
To avoid this potential singularity, one can modify the element formulation 
by making the rotations about the nodal fibers contribute to strains using 
drilling freedoms similar to that of as described in Sec. 20.5. Alternative is 
to identify a global fiber base coordinates for each node. All rotations at a 
common node are transformed to the global coordinates before assembling. 
In this case, there will be only two rotations (or five degrees-of-freedom to¬ 
tal) per node. Another alternative is to use the normal base coordinates for 
rotations as given in Eq. (10a). Due to the approximation of shell geometry 
by Eq. (1), the normal vectors to different elements at the common node 
will be usually different. Thus singularity may be avoided. 


Remark. A comment on the strain derivation is in order. In literature, the 
normal and fiber base coordinates are often called the running local Carte¬ 
sian coordinates and the strains referred to these coordinates are simply 
defined as 


(30) 



, l ( d H , 

^ 2 V dr} d£ ) 


etc. 


where it’s, £’s are the components in the referenced coordinate system. One 
then expresses u's and £’s in terms of their corresponding components in the 
local fixed Cartesian coordinates based on the coordinate transformation 
rule and proceeds with the partial differentiation. The transformation does 
not change the fact that the strains in Eq. (30) are defined based only on 
partial differentiation rather than on covariant differentiation (Sec. 2.12). 
Even though the normal and the fiber base coordinates are orthogonal, the 
orientation of these running local coordinates changes with location within 
the element for general shells. This change contributes to strains in the 
terms of the Euclidean Christoffel symbols. In other words, strains should 
be derived from the deformation gradient, du/dif, du/dfj, etc. in order to 
account for the covariant or contravariant differentiation, rather than just 
from the partial derivatives of the components of the displacement. 

In the exercises below, we use 9 as the circumferential coordinate of the 
cylindrical coordinates of unit base vectors e r ,ee, and a for the rotation 
about the z-axis. 


Problem 20.11. Consider a ring element of thickness t and radius ro over 
-6 0 <6 < do 

x = xo 4- <te r /2 = ro(cos0e x +sin#e v ) + Cfe r /2 = (r 0 -)- (t/2 )e r , 
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where 

e r = cos 0e x + sin0e y , ( = 2(r — ro)/t, £ = 0/Qo . 

The element has three nodes at d \ t 2 = ± 0 o ,03 = 0. Assuming the displacement 
in the form 


ii 

3 

t( \ 

uo + — aegj , 

/* 1 = -« 1 "« 
2 

h W + ® 

, h 2 2 


eg = — sin 8e x + cos 6e y , 
h 3 = l-£ 2 , 


show that the covariant base vectors and the metric tensor are 



and that (c) the physical component of the strain in the radial direction is 


(7) c « = (?) g f .|H = Mi /llsi n [0oU + i)] 

+ ^ h 2 sin[0o(£ - 1)] + ^ h 3 sin(0 o £) , 

which is not zero whereas its representation in the shell theory is zero. This 
quantity is of the order 0?, for small element (i.e., 8 0 is small). [Hint: Use the 
following relations: 


e«i ■ e# = e r i ■ e r = cos(0 — 0;), e«< • e r = — e,i • eg = sin(0 — 0i) , 


UOi — Uo»e x + V 0 ie y — Ur0ie r i + Ugoi , 02 = —01 = 00,03 -- 0 .] 


Problem 20.12. Repeat Problem 20.11 using 

3 

x = ^*(£)( x o + Cee/2 )i , ( x o)i = ro(e t cos0i + e^sin^). 


Problem 20.13. Let the displacements of the ring discussed above be 
u = u r (0)e r + ue(9)e$ + a(0)(r — ro)e» . 
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(a) Derive the physical strain components e rr ,e r g, e 0 o- 

(b) Let 

3 3 

u r (8)e r + u e {8)ee = ^(u 0 )h,(£) > a(0) = ^ . 

i=l »=i 

Find the differences between strains derived from the displacements above and 
those from Problem 20.12. Show the difference is of the order of 0% for small 0q. 

Problem 20.14. Using the tensor transformation rule, determine the com¬ 
ponents of D e , Ds in the fiber base coordinates in terms of those of D e , D s in the 
normal base coordinates. [Hint: e T D e e = e T D e e.] 

Problem 20.15. Use Eqs. (20.6:17) and (20.6:18) and e T D 3e e = e T D 3 e £, 
express the components of D 3 ,. in terms of those of £> 3 ,.. 


20.7. LOCKING AND STABLIZATION IN SHELL APPLICATIONS 

Doubly curved shell elements can lock in thin shell applications. We 
illustrate locking potentials in the example as follows. Let us examine the 
characteristics of s ££ and £( v at the midsurface of a 4-node quadrilateral 
element of constant thickness with xoi being the position of node i. Prom 
Eqs. (20.6:5) and (20.6:11)—(20.6:13), we have 


( 1 ) 

( 2 ) 


gf = a 2 + a 4 r], 


chi , , 

— = b 2 + b 4 r), 


e< = \ £ h i s i . 

z i =1 

^ + 6 v^]ihi , 


(3a) e €£ = (a 2 + s^rj) ■ (b 2 + b 4 ? 7 ), 
(3b) = i 


(b 2 + b^) • hiSi - (a 2 + a 4 rj) ■ ^~](0 £ e £ + 9 v e,,)ihi 


i=l i—l 

where S; is a unit vector in the fiber direction at node i and 


(4) a 2 = 


—Xqi + X02 + Xq 3 — Xq 4 


a 4 = 


Xoi — Xo 2 + X03 — Xo 4 


... , -Ui + u 2 4- u 3 - u 4 Ui - u 2 -f u 3 - u 4 

(5) b 2 -- - -, b 4 =- - -. 

In the limit of inextensional bending and zero transverse shear for thin 
shells, the coefficients of the polynomial for e ££ ,e ££ vanish. This repre¬ 
sents constraints on the parameters appearing in the coefficients. Since 
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is quadratic in rj and independent of £ and is quadratic in T) but 
linear in £, there are 9 possible constraints (3 from and 6 from s^) 
for the parameters. For general shells, it is difficult to identify which of 
these constraints is field-consistent (Sec. 20.3) and which is not. For flat 
parallelograms (a4 = 0 ,Si = S2 = S3 = S4), one can show that only the 
transverse shear has spurious terms which are from the coefficients of £ 
and £ 7 ? in the last term in Eq. (3b). We define 
4 4 

+ Qrfirijihi = ^2 hi = ®1 + © 2 £ + © 3»7 + © 4 £ t ? , 
i=l i= 1 

where 




0 , = 


©3 = 


'Fl +^2 + ^3 + ^4 
4 

-^1 - + ^3 + ^4 


©2 


04 = 


-’Fl + V(r 2 + ^> 3 _ if 4 


Fi - F 2 d F 3 - F 4 


The spurious constraints are then, 

( 6 ) © 2 * a 2 —► 0 , 0 4 • a 2 —> 0 , 


Similarly we can determine the spurious constraints for £,g. Then the field- 
consistent transverse shear strains are (Note: E?=i h* = |) 


(7) 


£ S< = 7 


b2 d" b4 T] ,-—,4 /.~v . \ 

--j-Ei=l S i- a 2'(©ld-© 3 T?) 


£ vC - , 


^4 M -Eti S< - a 3-(e 1 + 02£) 


which are to be used in calculating the shear strain energy to remove locking 
and in evaluating the transversal shear stresses to avoid linear oscillations 
within an element. This equivalent to evaluating e^ at £ = 0 and 
at r/ = 0 . 

Literature is full of techniques for locking removal. They include uni¬ 
formly reduced integration, selectively reduced integration, drilling degrees- 
of-freedom, mixed formulation etc. No single technique can completely rid 
of locking for general shell elements. The uniformly one-point integration 
4-node and 2x2 reduced integration 9-node quadrilaterals are most fre¬ 
quently used in practice. Reduced integration offers substantial savings in 
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computer time in deriving the element matrices because of reduced integra¬ 
tion operations. Further computer savings can be achieved by taking ad¬ 
vantages of the symmetric and anti-symmetric properties of dhi/d£, dhi/dr) 
at integration points. Choice of an element in applications must balance 
the computational efficiency against the desired accuracy of the results. 

Stabilization. Uniformly reduced integration always and selectively re¬ 
duced integration usually introduce spurious zero energy modes (kinematic 
modes). These modes occasionally give spurious solution in static analysis 
and often give spatially oscillatory solutions in dynamic simulation. The os¬ 
cillations are called the hourglass modes. Stabilization schemes have been 
devised to eliminate the kinematic modes (Hughes and Tezduyar 1981). 
Belytschko and his associates (Belytschko and Tsay 1983, Belytschko et 
al. 1984, 1989) developed a general stabilization matrix for 4-node quadri¬ 
lateral plate bending element with one point integration. The stabilization 
matrix is to be added to the element stiffness matrix. For the nodal pa¬ 
rameter matrix 

(8) q = [w T el elf, 

in which 

(9) w = [uii w 2 w 3 w 4 \ J , 6 X = [9 X i • • • f , By = [6 y i ■ • ■ f , 

with w's (out-plane displacements) and 0’s (rotations) being the nodal val¬ 
ues, they defined the following column matrices 

r = [1 1 1 if , g = [1 -11 -1] T , 
x = [xi x 2 x 3 x 4 f , y = [ 2/1 2/2 2/3 Vif 
bj = i [ 2/2 - 2/4 2/3 2/1 2/4 - 2/2 Vi ~ ysf , 

b 2 = - ^ [x 2 - X 4 X 3 -Xi X 4 -X2 Xi - x 3 f 
The rigid body modes are 

(11) q — [r T 0 0] T , [x T r T 0] T , [0 y T r T ] T . 

The kinematic modes are 

q = [g T 0 0] T , [0 g T 0] T , 

[0 0 g T ] T , [aix T + a 2 y T x T y T ] T , 


( 12 ) 
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where 


ai=r r y/ 4 , a 2 =r T x/ 4 . 


The stabilization matrix is in the form 


( 13 ) 


k/, 


ciaa T 0 0 

0 e 2 aa T 0 

0 0 c 2 aa T 


where 

( 14 ) a = g - ^5^- [(g T x)bj + (g T y)b 2 ), 


is chosen to be orthogonal to the rigid body motion. The choice is to assure 
that the stabilization does not affect the genuine zero strain energy modes. 
Note that 


r T g = r r bi = r T b 2 = 0, g r b x = g T b 2 = y T bi = x T b 2 = 0, 
x r bi = y T b 2 . 


The constants c’s are to be judiciously selected. If they are too large, the 
element will lock just like the fully integrated element, and whereas too 
small a value will fail to control oscillations in the solution. Note that a 
does not control the last kinematic mode of Eq. (12) because a is orthogonal 
to it. In practice, it does not matter because the last kinematic mode cannot 
exist in a mesh of two or more elements. 

Belytschko and Tsay (1983) proposed to define c’s in the form 


Cl 


nGt 3 

12(x T bj) 2 


(b^b 1+ b^b 2 ), 


c 2 = c e 


Et 3 


192(1 — i> 2 )x T h 


-(bfbi + ba b 2 


where c w , eg are universal constant independent of shape and material prop¬ 
erties of the element. The range of c w is between 0.03 to 0.1. The solution 
is generally insensitive to eg. It can have a value between 0.001 to 10 . The 
first kinematic mode (w = g), controlled by ci, is most sensitive. 


Remarks. Development of shell elements is conceptually straightforward 
but practically complicated. The advent of robust simple fiat plate elements 
has made modeling of complex shell structures a routine matter. The trend 



Sec. 20.7 


LOCKING AND STABLIZATION IN SHELL APPLICATIONS 


847 


is toward using shell elements deduced from solid directly. Locking is among 
the most perplexing problems in applications to general thin shells. It is of 
a lesser problem for higher order elements, except for the fully integrated 
8-node quadrilaterals. General techniques for locking mitigation include 
reduced integration, mixed and hybrid formulations, addition of drilling 
freedoms, etc. Reduced integration is very popular because it is simple to 
implement and offers substantially savings in computer time from reduced 
integration operations. Locking removal introduces rank deficiency (zero 
energy mode), which requires stabilization, especially for dynamic simula¬ 
tions. Research is continuing in the area of locking and rank deficiency 
removal. A simple robust (does not lock, passes the patch test, has no 
hourglass mode, requires the least number of multiplication, etc.) general 
shell element is still needed. 

In practical applications, the inclination is to use simple elements, es¬ 
pecially for dynamic simulations involving large deformation such as metal 
forming and crashworthiness studies. Choice of element must balance the 
computational efforts against the desired accuracy of the results. Four-node 
stabilized quadrilateral elements with one point integration are among the 
populists. 
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FINITE ELEMENT MODELING OF NONLINEAR 
ELASTICITY, VISCOELASTICITY, 
PLASTICITY, VISCOPLASTICITY AND CREEP 


Most of the accomplishments of the classical mechanics are based on the 
principle of superposition, which is rooted in the linearity of the governing 
equations. Every branch of mechanics is exploring beyond the linear world 
to examine the effects of nonlinearity. Nonlinear dynamics of the simple 
Duffing’s equation describing the motion of a single particle opened our eyes 
to the richness of the field. The theory of turbulence in fluid mechanics tells 
us how deep, controversial, and refractory to research efforts the subject 
can be. In solid mechanics, the following topics are awaiting for attention: 
large deformation, nonlinear constitutive equations, nonlinear boundary 
conditions, mechanical properties of living tissues in biomechanics, new 
materials of extreme complexity in structural composition, materials in 
micromachines and nanotechnology, living cells and tissues that cannot 
be identified as continua in the mathematical sense, micromechanics of 
heat transfer with large heat flux that occur in computers and electronic 
communication hardware, and so on. When mechanics has to deal with 
photonics, DNA, proteins, enzymes, some axioms of the current mechanics 
have to be changed. Every new change of an axiom leads to a new scientific 
field. Obviously, nonlinear mechanics is the mechanics of the future. It is 
obvious also that computational methods have an important role to play. 

In Chapter 17, it is proposed that the incremental approach is an ef¬ 
ficient way to deal with some nonlinear problems in solid mechanics. In 
Chapters 18, 19, and 20, the finite element method is singled out for a 
more detailed discussion in its applications to linear problems. In this final 
chapter, we discuss the application of finite element method to nonlinear 
problems in solid mechanics including large deformation, viscoelasticity, 
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plasticity, viscoplasticity, and creep. The purpose is to see some computa¬ 
tional features that arise because of nonlinearity. 

In incremental approach, small incremental steps are used to linearize 
the governing equations to be solved by the finite element method. The 
final solution may involve Iterative processes. The computational tools 
need extreme care when they are used in unknown territories. One of the 
distinct characteristics of nonlinear problems is that the solution may not be 
unique. In the nonuniqueness, there lies much of nature’s secret. Examples 
in solid mechanics are the buckling of thin shells, self-equilibrating residual 
stresses and strains, three-dimensional solutions in bodies with apparently 
two-dimensional boundary conditions, and many problems in plasticity. 

21.1. UPDATED LAGRANGIAN SOLUTION FOR LARGE 
DEFORMATION 

In Chapter 17, it is shown that the updated Lagrangian approach for 
large deformation problems leads to a set of linear equations for the stress 

° o* ° o 

increment rates T, r , S, r, the deformation rates D, and/or the displace¬ 
ment rate v, depending on the specific formulation. In this section, and 
this section only, we follow the derivation and notations of Chapter 17, and 
use boldfaced letters as tensors rather then matrices and follow the tensor 
operation rules, unless specifically stated otherwise. Note that we use the 
lower case subscript or superscript n to denote the current configuration. 
In solving for the rates, the information needed includes the current val¬ 
ues of the Cauchy stress t(= t„), the increment rate of boundary traction 

O 

T on S an , the increment rate of boundary displacement v on S un , the 
deformed geometry of the body in D n , and the constitutive law that re¬ 
lates the desired stress rates to the deformation rates. The constitutive law 
generally involves the total deformation gradient F" and the current Cauchy 
stress r. 

The finite element equations can be based on the integral formula¬ 
tion outlined in Chapter 18. Consider the purely kinematic variational 

O 

principle of Eq. (17.6:12) in terms of the rate-potential W defined in 
Eq. (17.5:25): 

(1) n 3 = £ (n 3 ) e , 

all elements 


where 



850 


FINITE ELEMENT MODELING OF NONLINEAR . . . 


Chap. 21 


( 2 ) 



W (Ae) + i t 


(AL • AL T ) - p n Ab • Au 


dV 



AT. Au dA, 


in which 


Au = vA t , Ae = (V n Au) s , 

O _ O 

AL = V ri Au, Ab =b At , AT =T A t, 
with Au being the displacement increment. The prescribed pressure and 

O 

dead-weight types of boundary traction increment rate T are given in 
Eqs. (17.10:2) and (17.10:5), respectively. The prescribed body force in¬ 
crement is given in Eq. (17.10:6). The compatibility requirements are that 
Au is continuous and equal to Au on S un as a rigid constraint. 

Assuming Au in the form 


Au = hAq 

with h and Aq being the interpolation matrix and the parameter matrix, 
respectively, we can derive the element stiffness and force matrices from 
Eq. (2). The incremental finite element equations are obtained from the 

stationary condition of II 3 with respect to Aq, i.e., 

( 3 ) a f / [5 W (Ae) + t : (AL • <5AL T ) 

all elements ^ en 


— p n Ah • <5Au] dV - 


L 


AT • <5AucL4 


= 0, 


subjected to the rigid constraint 5Aq = 0 over the surface where displace¬ 
ments are prescribed. Oden (1967) gave the early finite element solutions 
for large deformation of incompressible materials. 

Prom the finite element solution, we can calculate the 2nd Piola- 
Kichhoff stress increment in the updated Lagrange description 
[Eq. (17.5:10)] 


( 4 ) 


AS = 


d W 
9Ae 


and the Cauchy stress increment [Eq. (17.9:13)] 


(5) A At = AS + AL • r + r • AL T — tr(Ae)r . 
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Note that AS equals the Truesdell rate of the Cauchy stress. The updated 
displacements, coordinates and Cauchy stresses, are 

(6) u n+1 = u n + Au, X n+1 - X n + Au = Xo + u n+ i, 

Tn +1 = T + At . 

Here u„ is the total displacement vector at the nth state. 

One may include AO(t) as an independent field by modifying Eq. (2) 
as [Eq. (17.6:13)], 

(7) (n 3 )« = J W (Ae) + : (AL ■ AL r ) + [(V n Au) a - AO] 2 

— p n Ab • Au dV — / AT • AudA , 

J Js an 

where c is a given positive constant of the order of the elastic modulus, and 
(V n Au) a = [V n Au - (V„Au) t ]/2 , 

[(V n Au) a - Afi] 2 = [(V n Au) a - AO] : [(V n Au) a - An]. 
Equation (7) is equivalent to the penalty function approach in enforcing 

(8) Afi - (V„Au) a . 


If we express the functional in the form 

(9) (n 3 )e = ^P(Ae) + iT:(AL-AL T -Ae-Ae) 

- p n Ab • Au dV — f AT ■ AudA , 

Js a „ 


then since [Eq. (17.5:10)] 

( 10 ) 


Acr r 


OP 

dAe 


gives the increment of the corotation stress, which is the Jaumann rate of 
the Cauchy stress. From Eq. (17.9:12), we have 


( 11 ) k 


At = Acr r + AO • r — r • AO 


for updating the Cauchy stress. 
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To improve objectivity in computation, we use the midpoint rule 
Eqs. (17.9:6) and (17.9:9) that 

_ d 

Vjv+ i / 2 = £ 'ax' ,+ 1 / ’ 5 ’ 

AL = V w+1 /2 Au , 

Ae = - [Vjy +1 / 2 Au + (V/v +1 / 2 Au) 7 ) , 

An = - [V w+ i/ 2 Au - (V w+ i /2 Au) t ) 

One also updates the total deformation gradient, the related deformation 
matrices, and the rate-potential using the midpoint rule, e.g., Eq. (17.9:8) 
for F and Eq. (17.9:7) or (17.9:11) for R. For the formulation of Eq. (7), 
we use AO directly from the finite element solution in Eq. (11). 

21.2. INCREMENTAL SOLUTION 

Assume that the body force and the prescribed surface traction are in 
the form 1 

b = pb 0 , T = pT 0 , 

where p is a loading parameter. Let q„ be the current solution corre¬ 
sponding to p n . The assembled and constrained incremental finite element 
equation derived from Eq. (21.1:3) can be written in the form 

(1) A KAq = AF = (p„+i - p„)F 0 = ApF 0 , 

where K and AF, which can be functions of q n , are the assembled and 
constrained stiffness matrix and the load increment matrix, respectively. 
The updated q is 

qn+i = qn + Aq. 

Equation (1) is like a first order ordinary differential equation of p, and can 
be solved by integration. More accurate Aq (Pian and Tong 1971) can be 
obtained by the predictor-corrector integration method (Ralston, 1965). 

If K becomes singular at any solution stage, say at p = p c , then the 
numerical integration with respect to p may not be proceeded further. Near 

1 A more general form of external load is 

b = pcibo + b' , T = PC2T0 + T', 

where c 1, c 2 are given constants and b', T' are given preload. This allows the body 
force and the surface traction to increase at different ratio. 
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p c , a small increment in p can lead to a large increment in q. Thus, near 
p c , it is better to carry out the integration with respect to a parameter as¬ 
sociated with deformation (Pian and Tong 1971, Zienkiewicz 1971, Bergan 
1981). For example, let the new parameter be q r , a component of q that 
changes rapidly with respect to p. Partitioning Eq. (1) in the form 


K„ Kl\ 

l 

Aq' 

= Ap 

F' 

K 2 i K 22 


AqyJ 


F r 


we can rewrite Eq. (1) as 


1 - 

1 

i 

Aq'/A q r 


1 

i 

W 

to 

! 

Ap/Aq r 


-K 22 


where K u is nonsingular and symmetric. Equation (2) can now be inte¬ 
grated with respect to the new parameter q r (set the sign of A q r the same 
as that of the previous step) to determine Aq' and Ap. 

Let q r = (q r )c at p = p c . If Ap/Aq r changes sign at ( q r ) c , no solution 
for p exists beyond p c in the vicinity of (q r ) c while multiple solutions exist 
below p c . Thus, an infinitesimal increment in p beyond p c can lead to a 
finite increment in q r or the collapse of the structure. This is a phenomenon 
of nonuniqueness in the solution due to nonlinearity. The structure is said 
to be unstable and is called buckling (Fung and Sechler 1960, Timoshenko 
and Gere 1961). On the other hand, if Ap/Aq r is a monotonic function of 
q r , the structure is stable and can sustain load beyond p c . This is called 
stable buckling while q r changes rapidly in the vicinity of p c . 

The coefficient matrix of [Aq'/A q r Ap/Aq r ) T is in general asymmet¬ 
ric. We can, however, solve Eq. (2) as follows to avoid dealing with the 
asymmetric matrix: let 


(3) 


where 

(4) 


Aq' _ Aq; Ap Aq; 
A q r A q r Aq r A q r 


K 



= -K 


T 

21 ) 


(5) 


K 


Aq' 2 

u-7— 

A q r 


= F'. 


A substitution of Eqs. (3) into Eq. (2) yields 


Ap 


Aq' x ' 


Aq 2 \ 
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Thus we can obtain the rate of change with respect to the new parameter 
q r without having to deal with asymmetric matrix. 


21.3. DYNAMIC SOLUTION 

For dynamic simulation, we do not have to solve Au or Aq of 
Eq. (21.2:1) if an explicit time integration scheme is used. We can de¬ 
rive the equations of motion using the principle of virtual work discussed 
in Sec. 18.17, i.e., 

(1) <m= / [5u r up n+1 + (Se) T T n+1 - p n+ i<5u T b n+ i] dV 

JV n + 1 

- [ 5 u T T n+l dS = 0 , 
j(d I4)„+1 

in which 

(2) T n+ 1 = [t u 722 ■' • T 31 ]n+U <5u = h£q, <Se = B<5q, 

where r n+ i is the updated Cauchy stress given in Eq. (21.1:6), <5e is a 
strain variation as defined in Eq. (18.16:5), and h and B are interpolation 
matrices relating the parameter <5q to the displacement and strain variations 
as defined in Eq. (18.16:12). The boundary force is calculated from the 
surface integral in Eq. (1). For pressure surface load 

(3) T„ +1 = Pn+lNn-fi , 


and for dead-weight load 


( 4 ) 


= (To), 


dSo 

1 dS n+l 


— (To)n-I 


v /(N T FF r N), 


+i 


where p n +i is the prescribed pressure referred to the deformed surface in 
D n+ 1 configuration, and (To) n +i is the prescribed traction at the (n+l)th 
stage referred to the underformed surface. The element mass matrix is 
derived from the first term in the first integral that 


( 5 ) 


m n+ iq = 



q> 


which is in the same form as that for small deformation discussed in 
Sec. 18.17 except that the integration in Eq. (5) is over the deformed el¬ 
ement volume. In practice, one often uses the lumped mass matrix by 
summing all terms of each row to the diagonal. 
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Prom Eq. ( 1 ), one can calculate the acceleration at stage n+1. Using an 
explicit integration scheme, one determines Aq (i.e., Au) at the (n + l)th 
stage, then the increment At fromEq. (21.1:5) or (21.1:11) and the updated 
solution from Eq. (21.1:6), and then proceed to the next stage, 

21.4. NEWTON-RAPHSON ITERATION METHOD 

The Newton-Raphson method (Rafston 1965) is one of the most com¬ 
monly used methods for solving nonlinear algebraic equations. The method 
can be used to solve the nonlinear finite element equations of plasticity, vis¬ 
coplasticity and creep. Almost all solution methods for nonlinear problem 
involve some sort of iteration. It is no exception for the Newton-Raphson 
method. The process is illustrated in Fig. 21.4:1. Consider a set of nonlin¬ 
ear algebraic equations in the form 

(1) R(q)= 7 (q)-F = 0, 

where 7 is a nonlinear matrix function of q and F is a loading matrix 
normally independent of q. Assuming that the solution q„ at F == F„ 
is known, one seeks the solution q n +i at F = F n+ i = F n + AF n . Let 
q^i+i be a known approximate solution for q n +j at the ith iteration, one 



Fig. 21.4:1. Newton-Raphson method. 
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improves the approximation in the form q^\ — q ^ +1 + dq 1 . To the first 
order approximation, Eq. (1) can be written as 

(2) Kipdq' + R; = 0, 

where the tangent stiffness matrix Kj and the residual matrix R^ are 
defined as 

> R « =7(qi + i)-7(q»)-AF n = 7(q^ + i)-P»-AP„. 

R = <+1 

One repeats the process until 

(Rn) r Rn < e 2 (AF n ) r AF„, ( i , n not summed) 

where e 2 is a chosen small constant to control the accuracy of the solution 
for q n+ i and (R) 1 ) T R) 1 is the inner product of the residual and is usually 
called the square norm. 

There are variants to the Newton-Raphson method. The most common 
one is the modified Newton-Raphson method, which uses a fixed K' r for 
several iterations without update. Solving for dq 1 in Eq. (2) requires refac¬ 
torization of the matrix (see Sec. 18.5), if is updated. Factorization 
is a time consuming process. Updating less frequently saves computing 
time. One can even use throughout the entire iteration process of de¬ 
termining q„+i. In this case, is factored only once. Such procedure is 
likely to converge at a slower rate but the computer time required to reach 
a converged solution may actually be reduced. 

A linear search procedure can be used to accelerate the convergence of 
the Newton-Raphson method. The schematics of the procedure is shown 
in Fig. 21.4:2. Instead of using qj, +1 + dq 1 , we define 

qn+\ = q«+i +(!+*?) dq 1 

as an updated solution. The quantity r/ is determined by requiring the 
projection of the residual on the search direction dq 1 to be zero, i.e., 



Gi(v) = W) TR [qn+i + (I+ 77 ) cfq 1 ] 

= (dq‘) T {7[q; + i + (1 + rj) dq 1 ] - F„ +1 } =0 (i not summed). 
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(a) Interpolation (b) Extrapolation 


Fig. 21.4:2. Linear search schematic. 

The solution for r] can be obtained by iteration with ry = 0 as the initial ap¬ 
proximation. The linear search needs additional evaluations of the residual 
R. One normally undertakes the search only if 

\G{f])\ > e 3 |(dq l ) T [7(q^+i) - F„+i]|, 

where e% is a number close to 0.5 (Matthies and Strang 1979). 

In many problems, for a loading in the form 

F = pF 0 , AF„ = (p„+i - p„)F 0 = Ap n F 0 , 

no solution exists above a certain maximum value of p. When this occurs, 
one can recast the formulation to the increment of a single displacement 
component as shown in Sec. 21.2. 

21.5. VISCOELASTICITY 

Viscoelasticity is characterized by a constitutive law that the current 
state of stress depends not only on the current strains, but also on the full 
history of their development as described in Chapter 15. In the present 
study, we assume the constitutive equation in the form 

ft 

(1) a = D r (t)e(0 + ) + / D r (t — t) — dr, 

Jo or 

as given in Eq. (15.1:6), where cr, e, D r (f) are the stress, strain, relaxation 
matrices, respectively. If the material is inhomogeneous, D r (t) will be a 
function of the spatial coordinates also. Furthermore, in some problems, 
D r (£) may depend on <t and/or e. In finite element analysis, we write 
Eq. (1) in the incremental form 
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(2) <T „+1 - a n = fr n At — D r (0)Ae + [D r (£ n )e(0 + ) + Q n ]Af, 
where <7 n+ \. cr n denote the stresses at t = i n+ j,t ni respectively, 

(3) Ae = e n+ i — e„ = BAq, 

(4) Q n= j a D r{tn -T)-dT, 

in which B is the strain-displacement matrix relating the generalized coor¬ 
dinates associated with displacements to strains. In Eq. ( 2 ), the first term 
on right most hand side is the elastic solution with D r (0) equivalent to the 
elastic modulus matrix and the second term is the viscoelastic contribution. 

The finite element equation of motion can be written in the form 
[Eq. (18.16:22)] 

(5) R = f H T adV — F = 0, 

Jv 

where F is the loading matrix. Substituting Eq. (2) into Eq. (5) and using 
the strain displacement relation Ae = BAq, we find 

( 6 ) K°Aq„ = R n , 

where K° is the standard elastic stiffness matrix that 

(7) K° = [ B T D r (0)BdV 

Jv 

and R n is the equilibrium residual defined as 

(8) R„ = F n+1 - F„ - At j B T [D r (f n )e(0 + ) + Q n ] dV . 

Jv 

One can integrate Eq. ( 6 ) with initial condition e(0 + ) given. The prac¬ 
tical limitation in evaluating Eq. ( 6 ) is the requirement of the full history 
of e as shown in the integration of Q in Eq. (4). When t becomes large, 
this will require extensive computer storage and computation. In practice, 
often D, is an exponentially decaying function of the form 


D r (t) « exp(-t/r 0 ), 

where tq is a relaxation constant. Equation (4) can be approximated as 




i: 


i>r(tn-T)^dT, 


( 9 ) 


Q, 
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where a is a constant of order 1. Then it is only necessary to store the 
recent history of e from t n — aro to t n . 

For linear problems, one can proceed to the next time step after Aq n 
is determined from Eq. ( 6 ). For nonlinear viscoelastic problems, B can be 
a function of q. One may have to use the Newton- Raphson iteration to 
obtain the updated Aq n . 


21.6. PLASTICITY 

The linear incremental plasticity theory for infinitesimal deformation 
described in Chapter 6 are summarized below in both matrix and indicial 
notations: 


( 1 ) e = e® -f e p , ey = ey + e?. (elastic and plastic strain-splits), 

(2) r = D e e® = D«(e - e p ), 


Tij — D*j kl e e kl = D1j kl (e k [ — e kl ) (elastic constitutive equation) 


(3) e p = A— = kh r , Cy = A—— (normality condition flow rule) , 2 

(4) T) = T ~ a, Tjij = Ty - ay 

(5) rj' = r' - a', 


rj'ij = Ty - ay = rjij — <5yr/fck/3 (deviatoric components) 
(6) a — Ag, dy = Agij (back stresses) 


(7) £ p = 



/ 2 dh dh y /2 

\3dry dry/ 


(equivalent plastic-strain rate), 


( 8 ) h = (yield potential), 

(9) f(Vij,ii) = 0 (yield surface), 


2 In Eq. (21.6:3), h is considered to be a function of 6 stress components, i.e., for i ^ j, 
Ty, Tji are the same variable leading to = kdh/drij in matrix notation, while h 
is considered to be a function of 9 stress components, i.e., for i ^ j, r,j, tj, are the 
different variables leading to eft = kdh/drij in index notation. Note the following 
relation between tensor and matrix operations: let (-r)t, (e)t be the stress and strain 
tensors and (T) m , (e)m the stress and strain matrices as defined in Chapter 18, then 
(T) t : (e) f = (r)^(e) m = (e)£(-r) m . 
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(10) 

A = 0 if 

/< 0 

(elastic deformation) 


A > 0 if 

/ = 0 

(plastic deformation) 


/> 0 


(not allowed), 

(11) 


Jj 

Ad 

a. 

■<u 

£--0, 

OT k k 


where cry, ay, ey are the stress, back stress, and strain tensors, a primed 
quantity (-)' denotes the deviatoric component of the corresponding quan¬ 
tity, and £, are internal variables. The yield potential h, the yield surface 
f and the back stress #y are functions of rjij and internal variables The 
flow rule is associated if h = /. The internal variables can be the back 
stresses, plastic strains and temperature. Equation (10) written as 

A>0, A/ = 0, 

is known as the Kuhn-Tucker form for loading or unloading. In elastic- 
plastic analysis, one seeks the solution in terms of displacements and 
stresses that satisfy the equations of motion and the boundary conditions 
with Eqs. (l)-(ll) as constraints. 

The plasticity equations are highly nonlinear and can be solved in most 
cases only by numerical means. There is a very large literature on the sub¬ 
ject (Wilkins 1964, Krieg and Key 1976, Krieg and Krieg 1977, Ortiz and 
Popov 1985, Rice and Tracey 1973, Nagtegaal et al. 1974). To construct 
the incremental solution, consider the interval (t n , t n + i), where t is time 
for dynamic problems and a loading parameter that characterizes the mag¬ 
nitude of the applied loads or prescribed displacements for static problems. 
For simplicity we will just call it time in the subsequent discussion. At t n 
we assume that the total plastic and equivalent plastic strains, the stress 
and the back stress, e n , e p , e p , r n , a n , are known. We first construct the 
updated fields e„+i,e£ +1 ,..., at t n+ 1 in terms of the increment displace¬ 
ment Au in the manner consistent with the constraints of Eqs. (3)-(10), 
and then proceed to the next increment. 

The updating process is to determine e p +1 , £ r f +l and then derives other 
updated fields based on e£ +1 , e£ +l . In principle, e£ +1 , £ p +1 and a„+i can 
be determined by integrating the flow and hardening rules over the step 
(t n ,t n+ 1 ) for a given yield potential h and yield surface /. To illustrate 
the update process, we consider an associated von Mises material with 
only internal variable e p . If the material is isotropic and satisfies a linear 
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kinematic and nonlinear isotropic hardening law, then 

(12) a = 2ckr}/3, 

(13) h( V , s p ) = /( e p ) = \ V T PV ~ 5 4(^ P ) = 0, 
where ay is the yield stress in unidirectional test, and 

e p = [ i p dt , 

Jo 


(14a) 

(14b) 

(14c) 


*>4 


2-10 
-12 0 
0 0 6 


(plane stress). 


P = 


P 2 - 


Pi 0 
0 P 2 

2 0 0 
0 2 0 
0 0 2 


Pi = 


2 -1 -1 

-1 2 -1 

-1 -1 2 


(3-D, plane strain). 


Note that P is the projection matrix that makes Prj deviatoric. 

Following the approach described in Sec. 6.12 and assuming that 

f(r n + D e Ae,eP) > 0, 

i.e., yielding will occur, from Eq. (7), we find 

(15) i p = Ay | T) T P7] = | Aery . 

Differentiating Eq. (13) with respect to t gives the consistent equation 

(16) tj t P (+~\ cArj) - | ffy ~e p = 0 . 

Replacing (') quantities with A(-) in Eqs. (15) and (16), solving for AA 
and substituting it into Eq. (3) and into At = D e (Ae — Ae p ), we obtain 
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(17) D ep = jl + D e Pj}(Pt7) T / } D e , 

(18) At = D ep Ae, 

(19) AA = ?/ 7 PAr/ \°y(c+ , 

(20) Ae p = PryAA , 

2 

(21) A a = - cr) AA , 

where rj = T] n and ay = ff Y{Sn)- The updated fields are simply 

(22) T„ + i=r n + Ar, e p +1 = e p + 1 + Ae p ... . 

This process is called the tangent modulus approach. 

For ideal plasticity, i.e., c = day/de p = 0, we derive D ep and AA 
according to Sec. 6.10 [see Eqs. (6.10:8)—(6.10:13)], 

D ep = D e - D e Prr T PD e /(T T PD e Pr), 

(22a) 

AA = r T PD e Ae/(r T PD e Pr). 

The other equations are in the same form as before. 

For finite Au, the updated fields satisfy the yield condition with an 
error of the order (Au) 2 , i.e., /(rj n+ j,e p + 1 ) = O(Au) 2 . We seek improved 
updated fields in the form (Simo and Taylor 1986): 


(23) 

Ae 

= dAu = BAq. 


(24) 

Ae£ +a 

= rP7? ra+a , 


(25) 

Aa n+a 

= 2crrj n+a /3, 


(26) 

Ae£ +a 

-V273r(7 7 X +a Pr/„ +a ) 1 / 2 , 

(27) 

e n+l 

= e„ + Ae = e n 

+ B Aq, e n+a = e n + aAe 

(28) 

Tn+l 

= r„ + D e (Ae - 

- Ae p +a ), 


T n +a 

= r n + aD e (Ae 

-Ae p +a ), 

(29) 

1 

= a„ + Aa n+0 . 

i £*n-f-a — H - ^AQ^n+a , 
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(30) e £ +1 = e£ + Ae£ +a = e£ + rPr, n+a , e£ +a = < + aAe£ +Q , 

(31) e p n+a + A4 +a = e p n + r^+.P^WS ) 1 / 2 , 
e p n+a =e p +a&e p n+a , 

where a is a constant with value commonly 1/2 < a < 1 , chosen to improve 
the accuracy of Eqs. (15)—(17) and T is a positive constant to be determined 
from the yield condition Eq. (13) 

(32) f(Vn+a,£ P n+ a) ~ \ Vn+ a P Vn+a ~ \ <7y«+a) = 0 • 

In Eq. (32), r) n+a and e p +a are defined as 

(33) r] n+a = r„ + aD e Ae - aFD e P? 7 n+0 - ot n - 2 acrr 7 n+a /3 , 

(34) e p n+a =e p + ar^2r)% +a Prin+a/3. 

Solving for rj n + a from Eq. (33) gives 

(35) T) n +a = ((1 + 2ocr/3)I + arDeP ]^ 1 (r)n + aD e Ae). 

A substitution of Eqs. (34) and (35) into Eq. (32) furnishes a nonlinear 
algebraic equation for T to be solved by iteration. One can then update all 
fields according to Eqs. (27)—(31), which are second order accurate, i.e., 

f(Vn+ i,e£+i) = O(Au) 3 . 

For other yield surfaces, one may not be able to establish a single algebraic 
equation for T from the plastic constrained equations. In this case, one has 
to solve for T, T] n +a, e£ +a , and £^+a hi terms of Au simultaneously. 

For plane strain, Ae 3 i = Ae 32 = Ae 33 = 0 and Ae is a function of x 
and y only. For plane stress, q 3 i = TJ 32 = rj 33 = 0, and P and D e are given 
in Eqs. (21.6:14b) and (18.14:23). The solution is also a function of x and 
y only. To assure (r 33 ) n+1 = 0 , we update e 3 3 as 

(e33)n+l = [(Tll)n+1 + fejn+l] - [(«n)»+l + (e^Xi+l] ) 

for isotropic materials, and according to the appropriate strain-stress rela¬ 
tion for general materials. 

Isotropic Materials. It can be shown that, for isotropic materials, 

(35a) D e P = PD e , D e PA = 2GPt?, rj T PD e P?? = 2Grj T Pr ), 
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if the problem is plane strain or 3 dimensional and, 

E (2 - i>)mi - (1 - 2v)V22 
(35b) D e P?7 = _ ^ 2 . (2 - p )?722 — (1 — 2i/)tjh 

3(1 - v)rji2 

rj T PD e Pri ^ 2Gri 1 Pr ), 
if the problem is plane stress. Then, 

(36) P = QA p Q t , 

(37) D e = QA d Q r , 

where 

[l -1 0 ' 

(38) Q = 4= 1 1 0 , 

V [o 0 

(39) A p = dia[l/3 1 2], 

(40) A d = E dia 1 + v 1 - u > 

1 - v* [ 2 


(41) A d A p = E dia 1 - v 1 - v , 

1 — [ 3 

for plane stress, and 

\/3 -1 v/2" 

-\/3 -1 V2 , 

0 2 V2 

(43) Ap = dia[l 1 0 2 2 2], 



(44) 


Ad = 


E 

1 + v 


dia 



l + i/ 1 1 T 

1 - 2 v 2 2 2 _| ’ 


(45) AdAp = —— diall 10 111], 

l + i/ 

for 3-dimensional and plane strain problems, in which dia[...] denotes a 
diagonal matrix with the diagonal terms given in the bracket and I is a 
3x3 identity matrix. A substitution of Eqs. (36)-(45) into Eq. (35) yields 

(46) rjn+a = Q[(l + 2adT/3)I + aT AdA p ] -1 Q T (77 n + aD e Ae), 
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(a) Two ideally plastic materials 


(b) Equivalent stress - strain relation 
of two overlayed materials 


Fig. 21.6:1. Isotropic hardening simulated by two non-hardening materials. 

in which A^A p is a diagonal matrix. For pure kinematic hardening mate¬ 
rials, i.e., oy is constant, Eq. (32) reduces to a quartic equation in T for 
plane stress. The quartic equation has only one positive root for F and can 
be solved in closed form (Simo and Govindjee 1988). 

If the material is also ideal plastic, Eqs. (17) and (19) become 

Op Q 

(46a) D ep - D e - — Pr(Pr) T , AA = r r PAe, 

for plane strain and 3 dimensional cases. Other equations can be derived 
accordingly. For plane stress, D e Pr/ in Eq. (22a) is given by Eq. (35b). 

Sublayer Hardening Model. An alternative to model isotropic hard¬ 
ening is to use the overlay technique (Pian 1987). In this approach, two 
or more non-hardening materials are subjected to the same strains. Fig¬ 
ure 21.6:1 illustrates a two-nonhardening-material model with the equiva¬ 
lent stress-strain relation in uni-axial tension being 


Es 

for 

£ < £1 

0.2 E(e — £1) + 01 

for 

£1 < £ < £2 

OiCTj -j- O2O2 

for 

£2 < £, 


in which cq, a 2 (— 1 — oi) are the fractions of the two materials, oq, oq are 
their yield stresses and £i,e 2 are the corresponding strains at the onset of 
yielding. The yield strain is 

0 for £ < £1 

aj(£ —£ 2 ) for £1 < s 


■p _ 
T ~ 


for material 1, 
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£ 


V _ 
2 — 


0 

ci 2 (£ — ^ 2) 


for 

£ < £2 

for 

£2 < £ 


for material 2. 


Dynamic Analysis. In dynamic analysis, Au = u„ + i — u„ can be cal¬ 
culated from acceleration in explicit integration. The acceleration at 
is derived from the equation of motion based on the principle of virtual 
work, 


( 47 ) ▲ <511 


(<5u 7 up + Se 1 t — pSu 1 b)dF — f Su T TdS 


= 0 


with r, l+ i from Eq. (28) in terms of Ae, Ae£ +a from Eqs. (23) and (24). 
From the acceleration, one determines the new Au and proceeds to the 
next time increment. 


Static Analysis. For static analyses, Au is not known a priori at each 
increment. It must satisfy the incremental equation of motion, which can 
be derived from Eq. (47) in the form 

(48) <5AII = / [<5e r D e (Ae - Ae? i+a ) - P Su T (h n+l - b„)] dV 

Jv 

- [ du T (T„ + i — T n ) dS = 0, 

JdV„ 

or 

(49) KAq = F p (Ae£ +a ) + AF, 

where F p is derived from the volume integration of <5e r D c Aef t+a and AF 
from integrals involving the body force increment (b n+ i — b„) and the 
prescribed surface traction increment (T n+ i — T n ) and from prescribed 
boundary displacements by enforcing rigid constraints. Equation (49) 
differs from the elasticity equation only by the term F ; ,(Aef l+a ), which 
itself is a function of Aq. Recall that Au = hAq and Ae = BAq. 

Equation (49) is to be solved by iterative process together with the plas¬ 
tic constraints of Eqs. (23)—(31), a set of highly nonlinear coupled equations. 
One approach is to start the iteration by assuming Au equal to that of the 
previous step as a predictor, calculating the updated fields according to 
Eqs. (23)-(31) subjected to the constraint of Eq. (32) for locations where 
yielding takes place, evaluating F p (AeO_,_ a ) based on the predicted Au at 
the yielded locations, then solving Eq. (49) for the corrected Au. One re¬ 
peats the process until the estimated error for Eq. (32) is within a specified 
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limit. In this approach, K is the same as that of the elastic solution and re¬ 
mains constant throughout the entire solution process. All plasticity effects 
contain in F p (Ae p +0 ). This iteration process is sometime called the initial 
strain approach , i.e., an initial strain (in terms of Au) is first assumed and 
corrections are then sought. Various Newton-Raphson schemes have also 
used to determine Au. 

Another approach is to modify Eq. (48) to the form 
(50) [ [£e T D e (Ae - Ae p )] dV = f [cie r D* p Ae] dV 

JV Jv 

= f [<5eD e (Ae p +a - Ae p )] dV + f P 8u T (b n+1 - b„) dV 
Jv Jv 

+ f 5u r (T n+1 — T n ) dS 
JdV* 


or 

(51) K® p Aq = F p (Ae p +a — Ae p ) + AF, 

where D® p and Ae p are defined in Eqs. (17) and (20) with subscript n 
denoting the values at t n , and F p (Ae p +a - Ae p ) is derived from the first 
integral on right most side in Eq. (50). Thus K® p is derived using D® p rather 
than D e . For general materials, D* p and e p are derivable from formulae 
given in Secs. 6.10-6.12. One can use the same iterative method described 
earlier in this section to establish the solution for Aq, i.e., one starts the 
iteration with assumed Aq to calculate F p (Ae p +a — Ae p ) and determine 
the corrected Aq by solving Eq. (51). The method is called the modified 
tangent modulus approach. The conventional tangent modulus approach 
assumes F p (Ae p +a - Ae p ) to be zero. Since K® p depends on the field 
values, one must refactor K* p in solving for Aq, which is a time consuming 
process. However, the approach often improves the rate of convergence. 

One can also update K* p using the current iterated field values during 
the iteration for a given load increment. This is done in particular if many 
locations change from loading to unloading or vice versa in the step. Such 
changes greatly affect the value of D ep in the evaluation of K® p . 

Large Deformation. To avoid the confusion between the difference in 
the definitions of strain matrix and strain tensor, we shall use Cartesian 
coordinates and the index notation to illustrate the idea. For 3-dimensional 
problems, a strain matrix is a column matrix with 6 components 
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[Eq. (18.16:3)], 

e T = [en e 2 2 633 2 ei 2 2e23 2e3i], 

whereas a strain tensor has 9 components e^. If one assumes that the 
plasticity constraints Eqs. (24)-(26) hold for the incremental plasticity 
fields, i.e., 


(52) 


(Ae5)„ + .=r(|(-) , (Ao iJ )„ + „ = (§^~) 

\ UT 1] / n+a 6 \ or O J n+a 

n \ 1/2 

(AeP)„ +a = ^-Ae'Ae' J + , 


where (Ae(b) n+a denotes the value at t n+a , and the strain increments are 
calculated based on the current configuration, 3 i.e., 


(53) A eij = LDij 


1 <9Alq dAuj 

2 [dxf + ox; 1 


A ttu = 



dAuj 

dXJ 1 


the elastic-plastic analysis carries over for large formation by properly defin¬ 
ing the stress measure. For example if the stress increment is based on the 
Jaumann rate of the Cauchy stress [Eq. (16.9:20) or (17.5:3)], i.e., 


(54) (Ar ij ) n+1 = D* jkl [Ae kl - (Ae^) n+0 ], 

(A Tij) n +a — a ^ijkl[Aeki — (Ae^ ; )„ +a ], 

where D\- kl is the incremental elastic modulus matrix which can be a func¬ 
tion of the deformation gradient and the Cauchy stress at t n , then the 
updated Cauchy and back stresses are 


(55) (Tjj)n+l (Tjj I" A QikTkj "VikA^kj^n "b (ATjj) n _j_i , 

(56) ~b A QifcOtkj (+ikAQkj)n d" (Ao^j) n _|-l . 

If the increment of the 2nd Piola-Kirchhoff stress in the updated La¬ 
grange description is used in the incremental constitutive equation Eq. (54), 

3 Por dynamic analysis, Au is calculated from acceleration at each increment. One often 
defines 


A Cij 


1 

2 


where X j n+1/I2 


9A Ui dAuj 

dx n+l/2 + Qx n+ 1/2 
= XV + ±A Ui . 


A fij j 


1 dA m 



dAuj 

ax"- +1/2 
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we then calculate the Cauchy and back stress increments according to 
Eq. (21.1:15). 

For finite deformation, solutions obtained by different objective rates 
can be substantially different. Some of the solutions may be physically 
unreasonable. For example (Truesdell 1955), when a linear hypoelastic 
material is subjected to finite simple shear and the Jaumann rate of the 
Cauchy stress is used in the linear incremental constitutive equation, the 
shear stress is oscillatory. 

For dynamic analysis, the equation of motion has exactly the same for 
as Eq. (21.3:1), which is reproduced here in index notation 


(57) air 


If 


Uiiiip + SeijTij - p6uibi) dV 


L 


duiTi dS 


= 0 , 


with the updated Cauchy stress and the Jaumann rate defined in Eqs. (55) 
and (54). The plasticity effects are contained in the incremental stress- 
strain relations in terms of (Ae p kl ) n+a as given in Eq. (54). Thus once 

(Arjj) n+ i and (Aa,j) n+a are determined from the known Auj, one uses 
Eq. (57) to determine the acceleration at t n+ 1 and predict An* for the next 
increment. 

For static analysis, Au* is not known a priori and must be solved for 
from the equation of motion together with the plasticity constraints. The 
equation of motion is the same as Eq. (48) with additional terms to account 
for large deformation shown below: 

(58) <5All — I {b<?ij [AfijfcTfcj T(/-Aflfcj + (AT{j)ti+i] — 6ui[(pbi) n -i-i 

Jv „ 

- (pbi)n}} dV- f 5 Ui {( fOn+1 - CT0»] dS = 0. 

JdV an 


It goes without saying that the integrals in Eq. (58) are over the current 
configuration. The iteration process for small deformation applies to large 
deformation as well. 

The integration can be unstable if the incremental step is too large. 
Readers are referred to literature for stability criteria (Cormeau 21,3 1975, 
Ortiz and Popov 21,3 1985). A practical criterion for dynamic analysis is 
to limit the time step below the minimum characteristic dimension of all 
elements divided by the adiabatic sound speed (Hallquist 21,3 1998). The 
characteristic dimension is usually defined as the element volume divided 
by the largest surface area of the sides of the element. 


21.7. VISCOPLASTICITY 

For viscoplasticity materials, assuming linear viscosity and small defor¬ 
mation for simplicity, we can write the constitutive relations in the form 
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(1) 

1J T] dTij 

(2) 

i H(f) j. . 

— Ul\Tij , Si) j 

(3) 

Tij = Dt jkl (eij - eC), 

where rj is a known “viscosity” parameter, / is the yield function (only asso¬ 
ciated material are considered here), Ui is a known function characterizing 
the internal variable £», and H is the Heaviside function defined as 

(4) 

5 

3 

ii 

o 

■~S 

A 

o 


= f if /> 0. 


The internal variable can be the equivalent plastic strain and/or the back 
stress. The finite element equation of motion is derived from 

j deij+ij clV -Fi = J 5eijD"j kl (eki - e p kl ) dV - F t - 0 , 

which leads the finite element equation 

(5) K° T q = F ;> (e^) + F , 

where Ky is the stiffness matrix which is the same as elasticity derived from 
the terms associated with e^i, F p (e(f) is from the terms associated with cT 
which contains all the viscoplasticity effects, and F is the rate of applied 
load, and is equivalent to AF of Eq. (21.6:49). Equations (l)-(5) are a 
set of first order nonlinear differential equations of qi, e k[ , £*, which can be 
integrated with the initial conditions or solved by iteration. Note that, as 
the stress rate associated with e p kl always points toward the surface of the 
yield potential (—Dlj kl e kl — — AL) w j len yielding occurs, from 

Eqs. (1) and (3), the stress state lying outside (/ > 0) moves toward the 
yield surface. The solution approaches the yield surface, i.e., /(ry,£i) — > 0 
at the steady state. 

The main differences between the viscoplasticity and the classical plas¬ 
ticity theories are that the proportional factor H(f)/rj for the rates of vari¬ 
ables in viscoplasticity is specified explicitly in terms of the yield function, 
and that the time is a physical parameter. 

21.8. CREEP 

Creep is a time-dependent phenomenon even under a constant load. In 
infinitesimal deformation, the total strain is split into the instantaneous 
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strain efj and the creep strain e? as 

( 1 ) etj = efj + eXi ■ 

The constitutive law is defined in the form 

( 2 ) nj =D e mi {eij -e-Xj) 

for the instantaneous strain rate and 

(3) elj=Xij(Tki), 

for the creep rate, where Xij( T ki) are known functions of the current state 
of stress. The finite element equation of motion is the same as Eq. (21.7:5) 
with F p derived from terms associated e?.. The creep equations can be 
solved by iterative methods. (Mendelson et al. 1959, and Greenbaum and 
Rubinstein 1968). 

In this chapter, we have only touched upon briefly the application of 
finite element method to various nonlinear problems. Readers are referred 
to literature, which is voluminous, for details. Computational mechanics 
can provide us tools to explore the enormous richness of nonlinear 
mechanics: large deformation, composite structures, micro- and nano- tech¬ 
nologies, living cells, DNA, proteins, enzymes and beyond. 


Problem 21.1. Write the incremental energy function Eq. (17.5:25) in matrix 
notation 


W (D) = D T D e D/2 = D T D e D/2, 


where D is the symmetric velocity-like gradient referred to the current configu¬ 
ration x that D t = [Du D 22 D 33 2Di2 2D 23 2 D 31 ], and similarly for D with 
D’s replaced by D' s. They are related by the deformation gradient matrix F 
that 


D\1 D\2 

An 


'Du 

Di 2 

D 3 1 

A22 

A 23 

= F 


D 22 

D 2 3 

_sym 

D 33 


sym 


D 33 


The quantities D e , D e are incremental elastic modulus matrices. Show that 


6 - = 7[w (I '- H) + ^ A + G 





0 

1 

r 


'l 

0 

0' 

Qi 

0 " 




„ 1 





, Qi = 

1 

0 

1 

> Q2 ~ ~n 

0 

1 

0 

0 

Q2 





2 





1 

1 

0_ 


_0 

0 

1 
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0 C 33 C 22 0 —C 23 0 

0 Cn 0 0 -C31 

^ _ 0 — C12 0 0 

-C33/2 C31/2 C 23 /2 ’ 

sym —Ci i /2 C31/2 

-C22/2. 

where J[= det(F)], C’s are the components of the matrix C(— F r F) and, for the 
energy density function Wo defined in Eq. (16.11:16a), 

G = hCC T = b 4 hCC r , C T = [Cn C 22 C 33 Ci 2 C 23 C 3 i], 


where C’s are the components of the inverse of C. Expressing D in terms of D in 
the form D = RD, one has D e = R T D e R. Find R in terms of the components 
of F. 
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Incremental strain-stress relation, 
175, 197 

Incremental theory, 146 
Incremental variational principles, 
604, 610 

Inelastic bodies, 342 
Inextensional bending, 843 
Infinitesimal strain theory, 622 
Influence coefficients, 5, 6 
Initial condition, 494 
Initial strain approach, 866 
Inner product, 517, 519 
Integral formulation, 564, 635, 660 
Inter-element boundary, 627 
Inter-element equilibrium, 350, 761 
Interior node, 633 
Internal energy, 409, 412, 551 
Internal entropy production, 429 
Internal friction, 13 
Internal variable, 860, 869 
Internal variable-rates, 174 
Interpolation failure, 735 
Interpolation matrix, 660, 850 
Interpolation functions, 628, 631, 
665 

Intrinsic coordinates, 100 
Intrinsic time scale, 193 
Invariant, 83, 90, 140, 157 
Invariants of the stress tensor, 83 
Irons formulae, 729 
Irreducible functional, 346, 756 
Irreversible process, 410, 432 
Irreversible thermodynamics, 428 
Isolated system, 407 
Isoparametric elements, 680, 689 
quadrilateral, 688 
Isothermal conditions, 489 
Isothermal modulus, 459 
Isotropy, 139, 157, 490 
Isotropic elastic material, 140 
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Isotropic hardening, 178, 180, 189 
Isotropic materials, 170, 349, 422 
Isotropic plasticity theory, 157 
Isotropic tensor, 490 
Iterative methods, 651 

Jacobian determinant, 34, 522 
Jaumann stress rate, 547, 851, 868 

Kantorovich’s method, 403 
Kelvin solid model, 9, 10, 13 
Kelvin’s formula, 459 
Kelvin’s problem, 254 
Kichhoff stress tensor, 593 
Kinematic hardening, 182, 189 
Kinematic split, 555, 568 
Kinetic energy, 383, 409, 412 
Kirchhoff stress, 533, 565, 599 
Kirchhoff theorem, 210, 212 
Kirchhoff theory, 576, 795 
Kirchhoff’s hypothesis, 576 
Kirchhoff’s uniqueness theorem, 
290 

Koiter’s generalization of the flow 
rule, 165, 170, 176 
Kolosoff-Muskhelishvili method, 
301 

Kronecker delta, 31, 40 

Lagrange element, 684 
Lagrange interpolation, 631, 633, 
670 

Lagrange multiplier, 325, 327 
Lagrangian, 536, 538, 549 
base-vector, 521 
coordinates, 131 
description, 130, 582 
equations for thermoelasticity, 
481 

function, 382 
stress, 533, 538, 546 
Lame’s constants, 141, 170 
Lame’s strain potential, 243, 249 
Lamb’s problem, 273, 296 
Lap lace, operation, 519 
Laplace operator, 205, 241, 516 
polar coordinates, 290 
Laplace transformation, 492, 498 


Large deflections of plates, 581 
Large deformation, 514 

cartesian, coordinates, 514 
cylindrical coordinates, 514 
deformation gradient, 521 
Eulerian, 521 
Lagrangian, 521 
notations, 514 
plates, 557 
simple shear, 557 
spherical, 514 
strain rate, 514 
strains, 514 
stress rate, 514 
stresses, 514 
uniform tension, 557 
Latent heats of change, 425 
Lateral vibration of beams, 388 
Least squares method, 400 
Left polar decomposition, 527 
Left stretch tensor, 516, 526, 527 
Legendre polynomial, 808 
Limit analysis, 369 
Linear elastic solid, 4 
Linear element, 637 
Linear hardening materials, 180, 
181 

Linear search procedure, 856 
Linear strain triangle, 694 
Locking, 735, 809 
Lode parameter, 154, 155 
Logarithmic strain, 530 
Longitudinal waves, 392 
Long wave in potential flow, 785 
Love theory, 391 
Love waves, 235, 237 
Love’s strain function, 252 
Lower bound analysis, 372 
Lumped mass matrix, 854 

Material description, 128, 130 
Material rate in Lagrangian 
description, 589 

Material time derivative, 131, 528, 
529 

Matrix and Indicial notations, 514 
Maxwell solid, 9, 10 
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Maxwell system, 344 
Maxwell type material, 454 
Maxwell’s reciprocal relation, 6 
Maxwell’s stress functions, 286 
Maxwell-Boltzmann equilibrium 
distribution of velocities, 429 
Maxwell-Morera stress functions, 
238 

Mean stress, 87 
Mean stress relaxation, 191 
Method of finite differences, 402 
Metric tensor, 34, 35, 41, 42, 562 
Micro-slip of crystallographic 
planes, 200 
Midpoint rule, 852 
Mindlin-Herrmann theory, 391 
Minimization of functionals, 313 
Minimum surface of revolution, 318 
Minimum-maximum principle, 815 
Mixed four-field variational 
principle, 563 

Mixed functional, 348, 756, 813 
Mixed variational principle, 348 
Mixed tensor field, 40, 516, 517 
Modified Newton-Raphson 
method, 856 

Moduli of elasticity, 139, 140, 425 
Modulus of rigidity, 141 
Modulus tensor, 351 
Mohr’s circle for stress, 81, 86 
Mohr-Coulomb criterion, 161 
Moment of momentum, 69, 135 
Morera’s stress function, 286, 345 
Mroz’s multisurface model, 190 
Multi-step methods, 723, 725 
Multiply connected regions 113 
Multivalued displacements 117 

Natural boundary condition, 328, 
338, 565, 566, 797 
Natural coordinate, 631, 632 
Natural frequency, 17 
Natural or intrinsic coordinates, 

682 

Navier’s equation, 204, 238, 241 
Nearly incompressible materials, 
555, 568, 572, 612, 788 


Nernst’s theorem, 428 
Neuber-Papkovich Representation, 
268 

Neumann-Duhamel generalization, 
455 

Neumann’s theorem, 212, 218 
Neutral loading, 157 
Newmark /3-method, 722 
Newton’s laws, 69, 134 
Newton-Raphson method, 855, 856 
Newtonian potentials, 240 
Nodes, 627 

Non-communicative mode, 829 
Non-conforming element, 798 
Non-Euclidean space, 287 
Nonassociated flow rule, 172 
Noncommutative operation, 133 
Nonconservative force, 213, 334 
Nonconvex yield surface, 200 
Nonlinear elasticity, 848 
Nonobjective tensor, 544 
Legrangian strain, 544 
Nonuniqueness of solution, 849, 853 
Norm, 400 

Normal base coordinate, 835 
Normal coordinates, 447 
Normality condition, 859 
Normality of plastic strain rate to 
loading surface, 173 
Normalized local coordinate, 631 

Objective tensor, 544, 545 
Objectivity, 199, 543, 544, 598 
Octahedral plane, 88, 162 
Octahedral shearing stress, 88 
Octahedral strain, 118 
Oldroyd rate, 547 
Onsager principle, 432, 433, 435, 
440 

Onsager-Casimir theorem, 436 
Order of singularity, 767 
Orthogonality method, 401 
Oscillation, 12 

Papkovich-Neuber functions, 238 
Parameter matrix, 850 
Partitioning of energy, 273 
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Patch test, 731 
Peltier effect, 434 
Penalty functional, 789 
Penalty function, 355, 572, 851 
Perfectly plastic materials, 366 
Period of oscillation, 393 
Permutation symbol, 32, 40 
Phase velocity, 385, 386, 387, 393 
Phenomenological law, 433, 480 
Phenomenological variables, 152 
Piola-Kirchhoff rule, 538, 542 
Plane state of stress or strain, 79, 
280, 281, 288, 289, 467, 690, 697, 
782 

Plastic deformation of crystals, 200 
Plastic dissipation, 373 
Plastic flow, 168 
Plastic modulus, 175, 190 
Plastic potential, 168, 172, 174 
Plastic strain increment, 144, 167 
Plastic stress, 195, 197 
Plasticity, 14, 89, 145, 428, 855, 

859, 860, 868 

Plates and shells, 573, 795 
Pochhammer-Chree theory, 391, 

395 

Poisson equation, 220 

Poisson’s ratio, 141, 259, 420 

Porous materials, 172 

Positive definite, 332, 419, 420, 423 

Potential energy, 332 

Potential equation, 218 

Potentials 238-267 

Displacement, 241 
Galerkin vector, 246 
Love’s strain function, 252 
Maxwell-Morera stress 
function, 238 
Neuber-Papkovich, 268 
Newtonian, 240 
Vector potentials, 239 
Pochhammer and Chree’s results, 
386 

Prager’s kinematic hardening, 182 
Prandtl’s stress function, 405 
Prandtl’s soap film analogy, 223 
Predictor-corrector integration 


method, 852 

Pressure-sensitive materials, 160 
Principal axis, 82 
Principal directions, 80 
Principal planes, 82, 224 
Principal stresses, 80, 82 
Principal stretches, 526 
Principle of energy balance, 551 
extremum plastic work, 369 
maximum plastic dissipation, 
167, 370 

minimum complementary 
energy, 368 

minimum potential energy, 
333, 354, 367, 566 
minimum potential energy for 
thermoelasticity, 474 
plastic dissipation, 369 
superposition, 4 
virtual work, 8, 331 
Progressive wave, 386 
Projection matrix, 861 
Purely kinematic principle, 607 
P waves, 270 
Pythagoras’ rule, 35 

Quadratic element, 637 
Quadratic interpolation, 633, 635 
Quadrature formula, 726 
Quadrilateral elements, 682 
Quasi-static problems, 503 
Quotient rule, 45, 46 

Radiation condition, 275, 786 
Radius of gyration, 385 
Rank of matrix, 38, 759 
Ratcheting, 191 

Rate of deformation tensor, 413 
deformation gradient tensor, 
590 

Rate-potential, 600, 849, 852 
Rate of stress tensor, 593 
Ratio of adiabatic to isothermal 
elastic moduli, 459 
Rayleigh equation, 233, 385 
Rayleigh surface waves, 502 
Rayleigh-Ritz Method, 403, 661 
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Reaction rates, 432 
Reciprocity relations, 6, 507, 510, 
512 

Rectangular element, 682 
Reduced integration, 731, 750 
Reference state, 456 
Reflection, 270 

Refraction of plane wave, 270 
Reissner-Hellinger principle, 347, 
756 

Reissner-Mindlin plate theory, 795, 
805 

Relaxation, 492 
constant, 858 
function, 10, 12, 487, 490 
law, 497 
matrices, 857 
modes, 428, 444, 446 
spectrum, 14 
time, 11 

Relaxed elastic modulus, 12 
Remodeling of living tissues, 23 
Residual matrix, 856 
Residual, stress, 23, 294 
Resonance, 17 
Reversible process, 410 
Riemann-ChristofFel curvature 
tensor, 64, 126, 525 
Riemannian, 489 
Riemannian space, 50 
Right polar decomposition, 527 
Right stretch tensor, 527, 563, 568 
Rigid boundary conditions, 329, 
338, 566 

Rigid constraint, 850 

Rigid frame, 27 

Rigid-plastic materials, 368 

Rise time of signal, 18 

Ritz method, 662 

Rotation, 104, 700 

Rotation matrix for element, 701 

Rotation rate tensor, 545 

Rotation tensor, 105, 526, 527, 563 

Rotational waves, 242, 502 

Saint-Venant’s beam theory, 576 
Saint-Venant’s compatibility 


equations, 345 

Saint-Venant’s principle, 313, 355, 
356, 359, 362 
practial applications, 362 
Sternberg proof, 361 
Saint-Venant torsion theory, 405 
Scalar, 38, 238 
Scalar potential, 239 
SH and SV waves, 271 
Scattered wave, 785 
Second invariant, 144 

of stress deviation, 89 
Second law of thermodynamics, 
409, 430 

Second order accuracy, 863 
Second Piola-Kichhoff stress, 533, 
539, 547, 550, 556, 562, 850 
Secondary or S waves, 270 
Seebeck effect, 434 
Selectively reduced integration, 
736, 807 

Semi-bandwidth, 650 
Semi-Infinite Plate, 296 
Semi-inverse method, 215, 225 
Serendipity, 684 
Shallow shell elements, 830 
Shape function, 631, 638, 666, 703 
for 2-dimensional, 665 
for 3-dimensional, 702 
for Hexahedron, 706 
for matrix, 661, 668 
for 3-node triangle, 667 
for 4-node rectangle, 683 
for 6-node triangle, 669 
for 10-node tetrahedron, 705 
for retangle, 686 
for tetrahedron, 703 
Shear center, 227 
Shear locking, 736 
Shear modulus, 141 
Shear wave, 501 
Shear, equivoluminal, 242 
Shearing stresses, 85 
Shell as an assembly of plate 
elements, 822, 832 
Shock spectra, 18 
Simply connected region, 113 
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Six-node curved triangle, 681 
Six-node triangular element, 674 
Size effect, 149 
Skew-symmetric tensor, 61 
Snell’s law, 271 

Soft support conditions, 807, 822 
Soret effect, 434 
Sparseness, 650, 651 
Spatial description, 128 
Specific entropy, 414 
Specific heat, 425, 457 
Spherical coordinates, 123, 515 
Spin tensor, 529, 591 
Spurious constraints, 844 
Spurious deformation modes, 750, 
764 

Spurious zero energy modes, 736 
Square norm, 856 
Stability criterion, 815 
Saint-Venant’s principle, 355 
Goodier’s concept, 362 
Hoffs truss, 362 
Hoffs U-beam, 362 
State function, 407 
State variables, 407, 440 
Stationary conditions, 335 
Strain, 97, 525 

Almansi, 99, 102, 525 
Eulerian, 99 

finite strain components, 106 
Green, 99, 102, 525 
Cauchy, 99, 102 
deviations, 118, 140 
ellipsoid, 526 
Hamel, 99 
Lagrangian, 99 
St. Venant, 99 
Strain energy, 7, 383, 419 
Strain energy function, 139, 210, 
331, 380, 414, 418, 419, 422, 551, 
562 

Strain hardening, 151, 179 
Strain invariants, 118 
Strain potential, 243 
Strain rate tensor, 528, 591 
Strain space, 151 
Strain tensor, 97, 99-101 


symmetric, 100 
physical components, 121 
coordinate transformation, 838 
geometric interpretation, 103 
Stress, 66, 532, 542 
Stress concentration, 150 
Stress deviation, 87, 140, 144 
Stress functions, 208, 218 
Stress increment rate, 849 
Stress intensity factors, 768 
Stress principle of Euler and 
Cauchy, 66 
Stress rates, 546 
Stress singularity, 777 
Stress space, 151 
Stress tensor, 66, 516 
couple stress, 77 
general coordinates, 90 
octohedral shear, 88 
physical components, 94 
plane state, 79 
principal stresses, 82 
shear stresses 85 
symmetric property, 61, 77, 90 
transformation of coordinates, 
78 

Stress vector, 67 
Stress-free state, 456 
Stretching rate tensor, 529 
Stretch ratio, 522 
Sublayer hardening model, 865 
Subregion method, 401 
Subsequent loading surfaces, 174, 
177 

Subsequent yield surface, 183 
Summation convention, 30 
Super-element, 767, 782 
Superposition principle, 4 
Surface force, 70 
Surface waves, 231 

Tangent modulus approach, 862, 

867 

Tangent stiffness matrix, 856 
Ten-node tetrahedron, 705 
Tensor 

analysis, 30 
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components, 68, 516 
contravariant, 38, 39, 49 
covariant, 38, 39, 49, 
curvature, 51 
differentiation, 46, 48, 49 
equations, 42, 49 
Euclidean Christoffel symbols, 
48 

geometric interpretation, 52, 
58 

logarithmic strain, 516 
metric, 34, 35, 41, 42 
mixed, 40, 516 
permutation, 40 
physical components, 60, 68, 
94, 515 

quotient rule, 45 
rank of, 39, 40 
relative, 51 

skew-symmetry, 61, 591 
stress functions, 286 
symmetry, 61, 591 
Two-point, 516 
uniform, 516 
Tetrahedral element, 703 
Thermal effects, 459 
Thermal equilibrium, 408 
Thermal resistivity matrix, 475 
Thermodynamic equilibrium, 407, 
416 

Thermodynamic restrictions, 419 
Thermoelastic potential, 478 
Thermoelasticity, 456, 459, 461, 473 
Thin-walled circular cylinder, 539 
Third law of thermodynamics, 428 
Three-dimensional elasticity, 708 
Three-field variational principles, 
567, 606 

Three-node triangular element, 674 
Time-derivative operator, 491 
differential operator, 491 
Timoshenko beam theory, 387, 388 
Timoshenko equation, 388 
Torsion, re-entrant corners on 
shaft, 223 

Torsion, Saint-Venant theory, 213 
Torsion, Soap film analogy, 222 


Torsion, elliptic shaft, 221 
Torsional rigidity, 217 
Trace, 517, 605 
Traction, 67 

Traction compatibility, 571 
Transpose, 516 
Transverse shear, 578 
Trapezoidal method, 724 
Trefftz’ method, 405 
Tresca criterion, 153, 155, 162 
Triangular coordinates, 668 
Triple factoring method, 652 
TVuss, 26 

Twisting moment, 578 
Two-dimensional shape functions, 
665 

Two-field hybrid functional, 351 
Two-field principle, 572 
Two-point tensor, 2, 516, 522, 528, 
546 

Ultimate load, 147 
Unconditionally stable, 722, 723 
Uncoupled, quasi-static theory, 461 
Uniformly reduced integration, 736 
Unit base vectors, 514, 521 
Unit-impulse function, 11 
Unit-step function, 10 
Unloading criteria, 156 
Updated Lagrangian description, 
587, 588, 849 

Updated Lagrangian, Jaumann, 
and Oldroyd rates, 599 
Upper bound locus, 373 
Upper bound theorem, 370 
Upper, lower, flat yield region, 146 

Valanis’s endochronic theory, 193 
Variable bandwidth solution 
algorithm, 650 

Variational functional, 634, 789 
Variational Principle, 316 

Courant’s sharpening, 374 
finite elasticity, 562 
first variation, 322 
fundamental lemma, 316 
incompressible materials, 562 
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minimal surface, 318 
minimization of functionals, 
313 

necessary condition, 316, 317, 
324 

Vector potential, 238, 239 
Vector product, 33 
Velocity-like gradient, 591 
Vibration of a beam, 383 
Virtual complementary work, 340 
Virtual displacements, 330 
Virtual work, 8, 330, 331, 379 
Viscoelastic material, 487 
Viscoelasticity, 9, 10, 12, 149, 428, 
487, 857 
Viscosity, 428 
Voigt model, 9, 10 
Von Karman equations, 585 
Von Mises criterion, 152, 154, 155 
Von Mises yield condition, 144, 152 
Voronoi cell elements, 782 

Warping function, 215 
Wave dynamics, 18, 242 
Wave equation, 20, 242 
Wave group, 394 
Wave length, 386, 393 
Wave speed, 20, 502 
Waves, partition of energy, 273 
Waves, plane of polarization, 230 
Weighting factors, 726 
Work hardening, 152, 166, 171, 180, 
367, 368 

Yield condition, 187 

Yield function, 152, 172, 191 

Yield point, 15, 146 

Yield potential, 859, 860, 870 

Yield surface, 196, 859, 860, 870 

Young’s modulus, 19, 141, 420 

Zanaboni formulation, 356 
Zero-stress state, 203, 418, 419, 548 
Ziegler’s modified kinematic 
hardening, 184 



